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@ Co-expression in eukaryotic cells. 

CM 

^© A method of producing proteins of interest is disclosed. The method includes the introduction of a first DNA 
^sequence encoding the protein of interest and at least one additional DNA sequence encoding a protein which 
^procasses or stabilizes the protein of interest into a eukaryotic host cell. The host cell is subsequently cultured 
Sunder conditions which allow tne DNA sequences to be expressed. Suitable eukaryotic hosts inciude mammalian 
cells and yeast cells. 
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CO- EXPRESSION IN EUKARYOTIC CELLS 



T'*e present invention -eiates generally to the procucricn of orctems. arc rrore particularly to the 
pr:cLc::cn of croteins in oic;og;ca:!y active ?crm ana ;n econcmicaily feasible amounts. 

Eacxground of the Invention 



Recombinant DNA technology ^as been used to produce a variety of proteins of therapeutic or other 
rcccrr.ic value, including enzymes, growth factors and peptide hormones. These proteins have been 
o:0'-iced m bacteria, fungal :e is and. -more recently, cultured mammalian oe!is. Because single-celled 
organisms cannot correctly process many human proteins, it has often been necessary to use cultured 
mammalian ceils to make :hese orctems 

Mammalian ceils can be transferee ;c express cloned DNA by well-estaolished laboratory procedures. 
Ho.vever. not all mammalian ceii types will efficiently express transfected DNA sequences, and cells which 
■"ave ceen shown to ce af ; :c;ent excresscrs of one transfected sequence will ;n other oases produce only 
cw evels of different gene procucts. Low expression levels for transfected genes may result from 
degradation of the protein product ;ntra ano or extracellularly, production of inactive form(s) of the protein or 
production of formCs) of the orctein tnat are cytotoxic. Low levels of protein or activity may also result from 
an unstable mRNA seauence, proteolytic activation or inadequate, inefficient or improper processing by the 
host cell. Processing steps wmch may be necessary for the activity or secretion of a newly synthesized 
protein include specific crcteo!ytic cleavage, subunit polymerization, disulfide bond formation, pcst-tran- 
3 atiOnal or co-translationai mccification of certain amino acids and giycosylation. 

These problems in protein production reflect the specialized nature of cells derived from higher 
cganisms. Mammalian ceils cerivec from a particular tissue may not properly produce a protein that is not 
normally made by that tissue. In addition, mammalian cell lines adapted to grow in culture are derived from 
turners or are otherwise aoncrmal. often : eading to unpredictable protein processing. 

For example, a number of ^esearcn groups have produced human coagulation factor IX in cultured 
mammalian cells (Kaufman et al.. J. Biol. Chem. 261:9622-9628, 1986; Anson et al.. Nature 315:683-685, 
1385. Hagen et al.. EP 200,^21 "guscv et al.. Nature 316:271-273. 1985). Desoite efforts 7o~maximize 
orjojction of biologically active crctem through the use of strong promoters, enhancers, increased gene 
cocy number, etc, and despite the relatively high levels of factor IX mRNA observea, levels of active factor 
"X proGuced by these transfected ceil lines do not exceed about 5 ug/mi of cell cuiture medium. In some 
:ases orecursor forms of factor IX are made but mature protein is ineffectively secreted from the host cell. 

^'Oblems with protein crocuct;cn have previously been dealt with by experimenting with a number of 
different cell types and by selecting and screening a large number of isolates of a particular transformed or 
t'arsfected strain or cell line. Such an aporoach is extremely labor intensive and carries no assurance of 
success. Consequently, there is a need in the art for a method of systematically and predictably producing 
■•ectmcinant cells which can express orotems of interest in an active form in economically feasible amounts. 
T ~9 oresent invention provices sucn a system, and further provides other related advantages. 

Disclosure of Invention 



3r;efly stated, the present invention orovides a method of producing a protein of interest comprising the 
sreos of 'a) introducing into a eukaryotic host ceil a first DNA sequence encoding the protein of interest and 
at 'east one additional DNA sequence, the additional DNA sequence encoding a protein which processes or 
stabilizes the protein of interest; (b) culturing the host cell under conditions which allow the first DNA 
sequence and the additional DNA sequence(s) to be expressed; and (c) isolating the protein of interest from 
the host celL The step of introducing the DNA sequences into the host cell may be through (a) 
c oransrecticn or cotransformation with multiple vectors, each containing a separate expression unit; or (b) 
transfeciion or transformation with a single vector containing multiple expression units. When the host cell is 
2 ~ammalian host cell, the step of introducing may aiso be through transfection with a single vector 
; reaming a single expression unit that is transcribed into a poiycistronic message. When yeast host cells 
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are utilized, a preferred method for introducing the DNA secuences comprises (a) transforming the yeast 
nost cell with a singie expression unit containing the additional DNA sequence(s); (b) isolating host ceiis 
whicn stably produce the processing or stabilizing activity; and (c) transforming the isolated host ceils with 
the first DNA sequence encoding the protein of interest. Preferably, the initial transforming step results in 

5 the integration of the single expression unit into the yeast host cell genome. 

Preferred first DNA sequences include those encoding plasma serine proteases such as t-PA, factor VII, 
factor IX, factor X, protein C and plasminogen. Preferred additional DNA sequences include those encoding 
proteases, protease inhibitors, gamma-carboxylase, and proteins which bind to the protein of interest. 

In another aspect of the present invention, eukaryotic host cells into which a DNA sequence encoding a 

10 protein of interest as described above and one or more additional DNA sequences encoding a protein or 
proteins which process or stabilize the protein of interest as described above have been introduced are 
disclosed. Preferred eukaryotic host ceiis include mammalian host cells and yeast host cells. 

75 Brief Description of the Drawings 



Figure 1 illustrates the construction of the plasmid Zem99. 
Figure 2 illustrates the construction of the t-PA expression vector Zem2l9. 
Figure 3 illustrates the construction of Zem228. 
20 Figure 4 illustrates the subcloning of the alpha- 1 -antitrypsin cDNA. 

Figure 5 illustrates the construction of expression vector Zem235. 

Figure 6 illustrates the nucleotide sequence of a plasminogen cDNA, together with the encoded 
amino acid sequence. 

Figure 7 illustrates the production of plasminogen in transfected BHK cells (a), and in transfected 
25 BHK cells co-expressing alpha- 1 -antitrypsin (b). [Lanes 1 -media samples; lanes 2-cytoplasmic extracts. 
Arrows indicate the position of intact plasminogen (92 kd)]. 

Figure 8 illustrates the construction of the vector pD5. Symbols used are: 0-1, the adenovirus 5 0-1 
map unit sequence; E. the SV40 enhancer; MLP t the adenovirus 2 major late promoter; L1-3, the adenovirus 
2 tripartite leader; 5 , 5 splice site; 3 , 3 splice site; p(A) T polyadenylation signal; DHFR, dihydrofolate 
30 reductase gene. 

Figure 9 illustrates the construction of the vector pDX. Symbols are used as set forth for Figure 8. 
Figure 10 illustrates the construction of expression vectors containing the S. cerevisiae KEX2 gene. 
Figure 11 illustrates the results of an assay for activated protein C~o"n media samples from 
transfected 293 cells. 

35 Figure 12 illustrates the anticoagulant activity of activated protein C prepared in transfected mamma- 

lian celts. 

Figure 13 shows the results of a radioimmune precipitation of factor VII produced by celts co- 
expressing factor IX (lane 1) and by factor Vll-transfected control cells (lane 2). The arrow indicates the 
position of single-chain factor VII. 

Figure 14 illustrates the construction of a yeast expression vector containing a DNA sequence 
encoding a thrombin-cleavabie fusion protein. "Ml-3" indicates the insuiin precursor DNA sequence. 

Figure 15 illustrates the nucleotide sequence of a synthetic aprotinin gene. 

45 Best Mode for Carrying Out the Invention 

Prior to setting forth the invention, it may be helpful to an understanding thereof to set forth definitions 
of certain terms to be used hereinafter. 

50 Stabilize: The term stabilize is used herein to denote the protection of a protein from degradation. 
Stabilization may proceed by a number of mechanisms, including inhibition of a proteolytic enzyme which 
would otherwise degrade the protein of interest, binding to the protein of interest to protect it from a 
proteolytic enzyme, and binding to or otherwise inhibiting the action of a cofactor or other molecule required 
for the activity of a protease. 

55 

Process : As used herein, "process" means to modify the structure of a protein. Processing of proteins 
includes such modifications as specific proteolytic cleavage to produce a multi-chain protein or remove 
peptides from protein precursors; mocification of amino acids, including carboxylation and hydroxylation; 
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i" ~ torccnyorate adoption a: scec.f'C sites, recessing may oe necessary for fu: : o;C:CG:C£i ac:r/:;y of a 
"* item or may ce :ec:-. r ec to enac'.e the secretion of a orctem from a ce.i 

~ 2r Erection and Transformation: T u e orccsss of mtrocucing cored DNA into nest ceiis. T'ansfecticn refers 
:o r sertirg ON A into mammae an call lines. Toe orocess of inserting DNA into fungal ana cactenal ceils is 
- as transformation. A rumcer of transferror and transformation procedures are <ncwn n me art. 

As noted above, cells containing cicnea DNA sequences do not always prccuce proteins encoded by 
tnese cicnea seauences at economically feasible levels or in biologically active form. This is often due to 
me origin of the ceil or its abnormal nature. ! n many instances ;t may not be possiole to obtain a cultured 

*o oe. i ine or host' ceil strain with the necessary characteristics to enable it to produce a particular protein at 
the desired level. In any event, screening a large number of potential host cell lines may net be feasible and 
carries no assurance of success. 

The present invention overcomes the shortcomings of available cells by providing a method of 
deducing into a eukaryotic host cell a gene or cDNA encoding a protein of interest together with one or 

•= —ore additional DNA sequences encoding a protein or proteins which process or staoiiize the orotein of 
■•Me r est. Processing croteins include oroteases which cleave a orecursor protein at a particular site to 
ore vice the mature and/or active form of the protein, for example peptidases which remove signal peptides 
or orcpeptides, or which cleave a single-chain polypeptide to a multi-chain form. Other examples of 
orocessing proteins are those which modify amino acids, such as gamma-carboxyiase. an enzyme which 
modifies specific glutamic acid residues of certain clotting factors and other calcium binding croteins. Other 
or Massing proteins include enzymes responsible for the conversion of aspartic acid to ,imyarcxy asDartic 
acio. a modification necessary for the activity of protein C; those responsible for the addition of carbohy- 
drate chains to glycoproteins; and those responsible for mynstoylation, C-terminal ammo acid removal, 
hycrcxylation of proline residues, sulfation and C-terminai amidation. Stabilizing proteins mciude protease 

:s inhibitors which block the proteolytic degradation of the protein of interest; proteins wnich bind to the 
orotein of interest making it unavailable as a substrate; proteins which bind to protein cc-factors, ions, or 
otner molecules reauired by a protease; and Droteins which inactivate co-factors, it will be appreciated that 
a oarticular eukaryotic host cell can be transfected or transformed to produce several processing proteins, 
several stabilizing proteins, or a combination of stabilizing and processing proteins. 

30 The present invention is based, in part, on the unexpected discovery that a variety of processing 

orotems wiil function outside their native environments. For example, it is disclosed herein that factor IX. an 
enzyme which normally functions in the blood, can activate factor VII intraceiluiarly. Additionally, it has been 
found that the product of the yeast KEX2 gene functions normally within mammalian ceiis. The observed 
orocessing indicates that both the protein of interest and the processing protein are unexpectedly targeted 

;5 to toe same cellular compartment. The observed function of these processing proteins is also surprising in 
yiew of the fact that the proteins of interest may not be produced in large amounts or in an intact or active 
form in a recombinant cell. 

By characterizing a naturally occurring orotein of interest and. as necessary, the gene or cDNA 
encoding it, one can deduce the nature of the processing steps involved in its biosynthesis. Characterization 
:; me recombinant form of the protein will reveal whether or not it has been correctly processed and, if not, 
/mat processing steps were omitted. According to the present invention, proper processing is provided 
trmugh supplying the missing activity or augmenting a limiting activity. This activity may be suDplied as the 
orotein which normally processes the protein of interest, or as a related protein which normally performs a 
s;m: ar function in another context, such as a protein from a different cell type. An example of the latter case 

-'5 s the -ise of the yeast KEX2 gene to supply the processing protein which cleaves protein C or an activated 
protein C precursor to the two-chain form. Blocks in the secretion of a protein of interest can be determined 
by ;haracterizing the intracellular and secreted forms of the recombinant protein. In this way, the missing or 
tmitmg processing step is determined. 

in some instances one will know the identity of the protein supplying the missing activity. In this case, 
me desired gene or cDNA is cloned and introduced into the chosen host cell. 

When the protein responsible for the missing activity is not known, the protein of interest is analyzed, 
the nature of the missing processing step is determined, and a protein known to perform that function is 
selected. A suitable protein may be selected from known and available DNA sequences encoding proteins 
with similar activity. Many such processing proteins have been characterized. For example, Kettner and 

55 Shaw / Meth. in Enzymotogy 80:826-842, 1981) have characterized the specificities of a number of 
oroteases. Alternatively, a DNA sequence encoding the needed processing protein may be identified by 
'ransfecting cells with a mammalian expression library and selecting those that exhibit the needed activity. 
/Vhere protein stabilization is desired, ceils transfected or transformed to produce the protein of interest 
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are grown m :he oresence :i v = r:cus s:ac-. lizmg proteins and assayed for oroouction of the intact protein cr 
interest. Generally, the ceiis .viil be labeled with a radioisotope and proteins will be analyzed by 
radioimmune precipitation ana gei electrophoresis, or by other conventional techniques. The presence of 
the intact protein of interest in :he culture mecium indicates that the stabilizing protein is protecting the 

5 protein of interest from aegracaticn. Stabilization may also lead to phenotypic changes in the host ceil. For 
example, protease activity may cause cell detachment, which may be corrected by inhibiting that activity 
Correction is indicatea by the oresence of the normal (adherent) phenotype. 

When the desired processing or stabilizing protein has been identified, the DNA sequence encoding it 
and the DNA sequence encoding the protein of interest are introduced into chosen host cells as described 

ro below. Cells expressing the introduced sequences are selected and screened for production of the protein 
of interest in the desired form or at the desired level. Ceils which satisfy these criteria are then cloned and 
scaled up for production. 

Proteins of interest which can be produced using the methods of the present invention include a variety 
of plasma serine proteases, such as coagulation factors VII, IX and X, activated factor VII (designated factor 

75 Vila), activated factor X (factor Xa). orotein C, activated protein C, protein S, tissue plasminogen activator (t- 
PA), plasminogen, and analogs and derivatives of these proteins, although virtually any protein of interest 
could be produced. The methods are oarticularly suited to the production of these serine proteases due to 
the post-translational processing necessary for their activity and/or secretion by the host cell. 

According to the present invention, coagulation factors requiring gamma-carboxylation of specific 

20 glutamic acid residues for their biological activity may be secreted at high levels by cells into which has 
been introduced a DNA sequence encocing gamma-carboxylase. The gamma-carboxyiation step has been 
found by the inventors to be limiting in mammalian cell lines commonly used for the production of 
recombinant coagulation factors. Furthermore, the invention may permit the production of biologically active 
gamma-carboxylated proteins in non-mammalian host cells such as yeast cells. 

25 Coagulation factor Vll may be made in activated form by transfecting cells with DNA sequences 

encoding factor Vll and factor IX. The precursor form of factor Vll is activated by the factor IX and factor 
Vila is secreted by the ceils. Alternatively, a protein having the biological activity of factor Vila may be 
produced by co-expressing a DNA sequence encoding a derivative or analog of factor VI! and a DNA 
sequence encoding factor IX. DNA sequences encoding derivatives and analogs of factor VII are described 

30 in pending U.S. Patent Aophcation Serial Nos. 724,311 and 810,002, as well as published European Patent 
Application EP 200,421, which are herein incorporated by reference. As the activated form of factor Vll has 
been shown to have therapeutic value, it would be desirable to produce the activated form directly. Direct 
production would increase yields, reduce the number of production steps ana eliminate problems asso- 
ciated with activating blood products. 

35 In another embodiment, a DNA sequence encoding protein C, activated protein C or a modified protein 

C or activated protein C precursor is inserted into a cell line which has been transfected to express the 
yeast KEX2 gene. This gene encodes an endopeptidase which cieaves after a dibasic amino acid sequence 
(Fuller et al., in Leive. ec Microbiology: 1986 . 273-278, 1986). Processing may be further enhanced by 
also transfecting the ceils with :ne yeast KEX1 gene (Dmochowska et al., Cell 50:573-584, 1987), which 

jo encodes an enzyme wmcn removes the basic amino acids from the C-terminus of "the protein C light chain. 
Modified protein C precursors are described in pending commonly assigned U.S. Serial Nos. 924,462 and 
130,370, and published Eurooean Patent Application EP 266,190, which are incorporated by reference 
herein in their entirety. Preferred orotein C and activated protein C precursors include the amino acid 
sequence R* -R2-R3-R4-X-RS-R::- Ry-Rg . wherein Ri-Rg are Lys or Arg and X is a peptide bond or a spacer 

45 peptide of from one to twelve, creferably from two to eight amino acids, at the cleavage site between the 
light and heavy chains. 

Activated protein C can also oe produced in a cell line which has been transfected with DNA sequences 
encoding an activated protein C precursor, thrombin and thrombomodulin. The thrombin cleaves the 
activated protein C precursor (two-chain form of protein C) in the presence of thrombomodulin, and 

so activated protein C is secreted by the cells. Activated protein C precursors are described in U.S. Patent 
Application Serial Nos. 924,462 and 130,370, and in published European Patent Application EP 266,190, 
which are incorporated herein by reference. 

Factor Vila and Factor IXa can be produced in a cell line transfected to co-express Factor X. The Factor 
X cleaves the Factor Vll or Factor IX, resulting in secretion of the activated clotting factor. 

55 In yet another embodiment, the production of t-PA or a t-PA analog or derivative is enhanced by 

transfecting a host cell line to produce both t-PA or a t-PA analog or derivative and a protease inhibitor. 
Suitable protease inhibitors in this regard include TIMP (tissue inhibitor of metailoproteases), trypsin 
inhibitors and aprotinin, with aprotinin particularly preferred. Analogs and derivatives of t-PA are described 
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" lacing ^ 3. Ser-ai "*:s 322.^25 zcresccnai^:; ;o c^c/.snec ^acanese Parent Aoc.icaticn 63.133.9S8.'. 

395, 353.412: 053.4*; 253.251: 253.217- ana 125.529 arc m Eurccean Parent Aoc ! :ca*;cn ?uc;icaticn 
■ «cs. '.95.920; 201.153 ara 240.334, /vn;cr. are inccrpcratea -erein by reference. Ex cession or t-PA at 

:: erciailv feasibie 's-veis "as ceen d:~ic~it because ;re serine crctease activity of t-PA results r 

:e;acoment of ceils from sue cart 3:- races. This necessitates tne use of agents such as aorotinin :n tne 
c.a. *T~e jse o* acro::n r :n ^ecia aisc a:ic.vs t u e crccuct:on of -he s:ng,e-crain : orm of t-PA. 
a :e£:rac:e merapeutic prcccct. however aprotinin is corn costiy ana limited in availability. 

Plasminogen can be crccucea in intact form using cells transfected to prcauce both plasminogen and a 
crctease inhibitor. Variants cf O:asmmogen (as aescricec in U.S. Patent Acciicaticn Serial No. 053.412) may 
aiso be produced. A particularly preferrea orotease inribitcr in this regard is alpha- 1 -antitrypsin, aithougn 
variants of alpha- 1 -antitrypsin (U.S. Patent No. 4,711,348) and other protease inhibitors may also be 
employed, intracellular plasminogen activation and subsequent degradation have limited the ability to 
produce recombinant plasminogen at reasonable levels. Inhibition of plasminogen activity by AAT would 
ameliorate this crobiem. 

Other examples of protem-stabiiizer combinations mcluae stacilization of protein C by co-expression of 
c: stem S. stabilization of factor VIII oy co-exoression of von Wiliebrand factor and stabilization of factor VII 
by co-expression of tissue factor. BiP (Haas and Wabl, Nature 306:387-389, 1983) may also be used to 
s'abiiize various proteins. A cDNA encoding BiP is described by Munro and Pelham (Cell 46:291-300, 
1986k 

DNA sequences useful m carrying out the present invention may be clcned by standard procedures 
-toa'p: m trie art. Genomic or cDNA sequences may be used. Many such clones have been described m 
the literature, including DNAs encoding t-PA (Pennica et al.. Nature 301:214-221, 1983), factor VII (Hagen et 
al 'bid. : Hagen et al.. Proc. Natl. Acad. Sci. USA 83:2412-2416, 1986)", factor IX (Karachi ana Davie, Proc. 
Natl. Acad. Sci. USA 79:6461-6464. 1982), plasminogen (Maiinowski et al., Biochemistry 23:4243-4250, 
:984- Fcrsgren et al.. FEBS Lett. 2T3:254. 1987), alpha-1- antitrypsin (Long et al.. Biochemistry 23:4828- 
4837. 1984; Kawasaki, U.S. Patent No. 4,559,311), protein C (Foster and Davie. Proc. Natl. Acad. Sci. USA 
31 4756-4770. 1984: Fester et al.. Proc. Natl. Acad. Sci. USA 82:4673-4677, 1985), prothrombin (Friezner- 
Degan et al.. Biochemistry 22:2087-2097, 1983), factor" VIMToole et al.. Nature 31^:342-347, 1984), von 
Willeorand factor (Lynch et ai.. Cell 41^:4956. 1985; Collins et al., Proc. Natl. Acad. Sci. USA 84:4393-4397, 
1987), tissue factor (Spicer et ai., Proc. Natl. Acad. Sci. USA 84:5148-5152, 1987) and factor X (Leytus et 
al.. Biochemistry 25:5098-5102. 1986). Additional clones may be~obtained by screening cosmia, genomic or 
cDNA libraries with oligonucleotide probes designed on the basis of amino acid sequence data or with 
cloned DNA fragments; or through the use of expression libraries which are screened with antibodies to the 
protein of interest (Young and Davis. Proc. Natl. Acad. Sci. USA 80:1194-1 198, 1983), by ligand blotting 
(S keia and Hahn, Proc. Natl. Acad. Sd. USA 84:3038-30427T987) or~by assaying for activity. 

The cloned DNA sequences are inserted into suitable expression vectors which are in turn used to 
trar.sfect or transform suitable eukaryotic host cells. 

Expression vectors for use in carrying out the present invention in mammalian ceils will comprise a 
promoter capable of directing :r.e transcnotion of a cloned gene or cDNA introduced into a mammalian cell. 
/ra ; promoters are preferred due to their efficiency in directing transcription. A particularly preferred 
promoter is the major late promoter from adenovirus 2. Other suitable promoters include the SV40 promoter 
Subramani et al., Mol. Cell Bioi. 1:854-864, 1981) and the MT-1 (metallothionein gene) promoter (Paimiter 
at a! . Science 222:809-814 '983). Such expression vectors may also contain a set of RNA splice sites 
:oa:ed oewnstream from the oromoter and upstream from the insertion site for the cloned DNA sequence 
or //'thin the cloned sequence tself. Preferred RNA splice sites may be obtained from adenovirus and/or 
mmunoglobulin genes. Also contained in the expression vectors is a polyadenyiation signal, located 
downstream of the insertion site. Particularly preferrea are viral polyadenyiation signals, such as the early or 
late polyadenyiation signals from SV40 or the polyadenyiation signal from the adenovirus 5 Eib region. 
E^cression vectors useful in carrying out the present invention may also comprise a noncoding viral leader 
sequence, such as the adenovirus 2 tripartite leader, located between the promoter and the RNA sDlice 
s tes. Preferred vectors may aiso include transcriptional enhancer seauences, such as the SV40 enhancer, 
ana translations! enhancer seauences, such as the sequences encoding the adenovirus VA RNAs. 

Vectors containing cloned DNA sequences may then be introduced into cultured mammalian cells by, 
for example, calcium phosphate-mediated transfection (Wigler et at., Cell 14:725, 1978; Corsaro and 
Pearson. Somatic Cell Genetics 7:603. 1981; Graham and Van der Eb, "Virology 52:456. 1973) or 
e ectropcration Neumann, EMBO J. 1:841 -845, 1982). A small fraction of the ceils integrate the DNA into the 
genome or maintain the DNA m non-chromosomal nuclear structures. In oraer to identify these integrants, a 
^e^e that confers a selectable onenotype (a seiectabie marKen is generally introduced into the cells along 
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'.vi th the gene of interesr. ^referred selectable marners include genes that confer resistance to druqs, such 
as neomycin, nygromycm. anc methotrexate. If calcium onosphate-meaiated transfecticn is used, selectable 
markers may be introduced into the ceil on a separate plasmid at the same time as the gene of interest, or 
they may be introduced on the same plasmid. If on the same olasmid, the selectable marker and the gene 

5 of interest may be under the control of different promoters or the same promoter, tne latter arrangement 
producing what is known as a dicistronic or polycistronic message. Constructs of this type are known m the 
art (for examcle, Eurooean Patent Office Publication No. 117,058, U.S. Patent 4.713.329). After the cells 
have taken up the DNA, they are allowed to grow for a period of time, typically 1-2 days, to begin 
expressing the gene of interest. Drug selection is then applied to select for the growth of ceils which are 

w expressing the selectable marker. When using methotrexate selection, increasing the drug concentration in 
a stepwise manner allows selection for increased copy number of the cloned sequences, resulting in 
increased exoression levels. Clones of such cells may be screened for production of the protein of interest. 
Useful screening methods include immunological assays and activity assays. 

Preferred mammalian cell lines for use in the present invention include the COS (ATCC CRL 1650), 

?5 BHK (ATCC CCL 10) and 293 (ATCC 1573) cell lines as well as derivatives and isolates of these cell lines, 
although other cell lines may be preferred for production of particular proteins. A preferred BHK cell line is 
the tk*tsl3 BHK cell line (Waechter and Baserga. Proc. Natl. Acad. Sci. USA 79:1106-1110, 1982), 
hereinafter raierre6 to as tk~BHK cells. Other useful adherent cell lines include RaT~Hep I (ATCC CRL 
1600), Rat Hep II (ATCC CRL 1548), TCMK (ATCC CCL 139), Human lung (ATCC CCL 75.1), Human 

20 hepatoma (ATCC HTB-52), Hep G2 (ATCC HB 8065), Mouse liver (ATCC CC 29.1) and DUKX cells (Urlaub 
and Chasm, Proc. Natl. Acad. Sci. USA 77:4216-4220, 1980). Useful suspension cell lines include AtT-20 
(ATCC CCL 89), MOLT-4 (ATCC CRL 1582), BW5147.G.1 .4.0UA ft .1 (ATCC CRL 1588), S194/5.XXO.BU.I 
(ATCC TIB 20), EL4.BU.1 .OUAM .1 (ATCC TIB 41), Sp 2<0-Ag14 (ATCC CRL 1581), J558L (Oi et al., Proc. 
Natl. Acad. Sci. USA 80:825-829, 1 983) and Raji (ATCC CCL 86). 

25 In general, a host ceil line will be selected on the basis of its ability to produce the protein of interest at 
a high level or its ability to carry out at least some of the processing steps necessary for the biological 
activity of the protein. In this way, the number of cloned DNA sequences which must be transfected into the 
host cell line may be minimized and overall yield of biologically active protein may be maximized. However, 
the present invention allows one to produce virtually any protein in practically any ceil line which can be 

30 cultured in vitro. 

DNA sequences encoding the protein of interest and the processing and/or stabilizing protein(s) may be 
introduced into the cell on the same vector or on different vectors. It is preferred to use a singie vector with 
one selectable marker in crder to minimize problems which can result from marker instability. Genes or 
cDNAs on a vector may be controlled by separate promoters or by a single promoter. In the latter 

35 arrangement, which gives rise to a polycistronic message, the genes or cDNAs will be separated by 
transiational stop and start signals. When transfecting with a large number of DNA sequences, practical 
limitations on vector size may necessitate the use of two or more vectors, each with its own selectable 
marker. Two or more vectors may, of course, be used whenever co-expression of a protein of interest and a 
stabilizing cr crocessing protein is desired. 

■io Other eukaryotic cells, including yeast and filamentous fungi, may also be used within the present 

invention. These lower eukaryotic hosts provide certain advantages over mammalian cell lines, including 
ease and economy of culturmg and existing industrial fermentation capacity. By manipulating cells as 
described herein, fungai cells and other eukaryotic cells capable of expressing clonea DNA sequences can 
be used to produce virtually any protein of interest. 

^5 For example, cells of bakers' yeast ( Saccharomyces cerevisiae) can be transformed with cloned foreign 
DNA sequences and cultured to high cell densities, and will exoress the clonea DNA and secrete the 
foreign proteins. In some instances, however, the foreign proteins are not secreted or are inactive due to a 
lack of proper processing or proteolytic degradation. Yeast cells cannot, for instance, gamma-carboxylate 
proteins or add mammalian-type complex carbohydrate chains to glycoproteins. According to the present 

so invention, this lack of processing can be overcome by transforming yeast host cells with DNA sequences 
encoding the missing prcteinfs) (e.g., gamma-carboxylase, glycosylating enzymes). Degradation of foreign 
proteins may be overcome by transforming the cells to produce protease inhibitors or proteins which bind 
to the protein of interest. An example of a suitable binding protein is BiP. Binding proteins may also 
enhance secretion of a foreign protein by altering its conformation. In addition, yeast cells can be 

55 transformed to produce foreign proteases, enabling them to produce and secrete active forms of foreign 
proteins (e.g., mammalian serine proteases) which require specific cleavage for secretion and/or biological 
activity. Junction points between secretory peptides and mature proteins may be engineered so that 
cleavage cy a co-expressed protease (e.g., thrombin) releases the mature protein. 
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E^arvcnc micrccr-ar.sr^s. s-cr* as t~e yeast Saccrarcmyces cerevsiae, cr : 1 iar.en::.s 'urg: mci^cing 
-3cerg;.us soec:es. .-ray ce trars'crmec according :c <rcwn crccecjres. Ascergjiius scec;es may be 
■-'ar£rc r meo, ; or examcie, according to the metnoa of Yelton et al. fProc. Natl. Acaa. Sc;. USA 81:1740- 
' '984). Particularly ore'e^rec scec:es cf Asce r g:Hus .ncluce A. ^idcTans. A. niger. A. pryzae - and A. 

r e:.s Techmoues for transferring yeast are aescncea. for example, oy 5 eggs (Nature 275:1 04-1 08^ 
"9~5: E/oression vecicrs :cr ^se ;n yeast :-c:uoe YRo7 fStruhl et al.. P'cc. Natl. Acad. Sci. USA 76: 1 035- 
" :39 1979), fEcl3 <'3rcacr. et ai.. Gene 3:121-133. 1979), pJDB243 ana djCB21 9 Iteggs ibia). and 
derivatives thereof. Such vectors will generally ccmcrise a selectable marker, such as the nutritional marker 
TRPl , <vnich allows selection ;n a host strain carrying a trpl mutation, or the POTT selectaole marker, which 
permits selection in a tpi-strain grown in rich medium (Kawasaki and Beii, E? 171,142). Preferred promoters 
f or use in yeast expression vectors include promoters from yeast glycolytic genes /Hitzeman et al., J. Biol. 
Chem. 255:12073-12080, 1980; Alber and Kawasaki, J. Mol. Appl. Genet. 1:419-434, 1982; KawasakT TTS. 
Patent No. 4,599,311) or alcohol dehydrogenase genes (Young et al., in Genetic Engineering of Microorgan- 
'srs for Chemicals , Hollaencer et al.. etis.. o. 335, Plenum. New York, 1982; and AmmeTer,, Meth. m 
Enzyme logy 101:192-201. 1983). To facilitate secretion of a protein of interest produced in a yeast 
"arsrorrrant and to obtain oroper processing, a signal sequence will generally be provided. Preferably the 
signal sequence will be ootained from a yeast gene encoding a secreted protein. A particularly preferred 
s-grat secuence is the pre-orc region of the MFqI gene {Kurjan and Herskowitz, Ceil 30:933-943. 1982: 
Kuran et aL, U.S. Patent 4.546.082 and Singh, EP~123.544). 

Co-expression of DNA seauences in yeast may be achieved in several ways. It is preferred that the 
-e/cression unit for the processing or stabilizing protein be integrated into the host ceil genome and an 
isoiate which stably produces the processing or stabilizing activity be selected. An expression vector 
containing the DNA sequence for the orotein of interest is then transformed into the host strain. Alter- 
natively, the two expression units may be on different autonomously replicating expression vectors with 
different selectable markers or on a single expression vector. 

In a preferred embodiment, a DNA sequence encoding a foreign processing protein is inserted into a 
DNA sequence encoding a yeast protein having a similar function. The insertion is designed to substitute 
the ; c r eign sequence for yeast seauences encoding the processing ; unction(s) of the yeast protein. A 
particularly preferred such yeast protetn is the KEX2 gene product. This protein has been analyzed and its 
catalytic and other domains have been characterized (Fuller et al., Yeast Cell Biology. Cold Spring Harbor 
Laboratory, Cold Spring Haroor. New York, 1987, page 181). The resulting hybrid sequence, comprising the 
foreign DNA sequence and yeast sequences encoding transport and cellular localization domains, is then 
joined to a yeast promoter and terminator. Preferably, the promoter is a strong promoter, such as the TPI1 
promoter. This construct may also contain flanking sequences to target the expression unit to a particular 
r.tegraticn site in the host cell genome. 

Proteins produced according to the present invention may be purified from cell-conditioned media by a 
/anety of procedures known in the art, which may be selected according to the physicochemical 
:-aractenstics of the particular orotein. Suitable methods include affinity chromatography, ion exchange 
chromatography, gel filtration, mgn performance liquid chromatography, and combinations of these meth- 
ods. 

Proteins produced according to tne present invention may be used within compositions for pharmaceu- 
tical, industrial, research and other purposes. For pharmaceutical use, the proteins will generally be 
combined with a physiologically acceptable carrier or diluent, such as sterile water or sterile saline, and 
packaged in individual doses in sterile vials. Alternatively, the proteins may be packaged in lyophilized form 
anc -eccnstituted prior to administration. Administration will generally be by injection or infusion. Phar- 
maceutical compositions may further contain additional proteins or other compounds of therapeutic value, 
adjuvants, local anesthetics, etc. 

The following examples are offered by way of illustration and not by way of limitation. 

EXAMPLES 



Except where noted, conventional molecular biology techniques were employed. Restriction en- 
donucleases and other DNA modification enzymes (e.g., T4 polynucleotide kinase, calf alkaline 
croscnatase, DNA polymerase ! Klenow fragment, T4 polynucleotide iigase) were obtained from Bethesda 
Research Laboratories, New England Biolabs cr Boehringer-Mannheim and used according to the suppliers' 
■rstr-jctions or as described in Maniatis et ai.. eds.. Molecular Cloning: A Laboratory Manual. Cold Spring 



8 



EP 0 319 944 A2 



Haroor Laboratory. Ccid Spring Haroor. NY. 1982. Gligonucieotices were synthesized on an Applied 
Biosystems model 38CA DMA synthesizer and purified by poiyacrytamide gei electrophoresis on denaturing 
gels. 

5 

Examole 1 



w Production of Tissue Plasminogen Activator 



Tissue plasminogen activator (t-PA) cDNAs have been described in the literature. See, for example, 
Pennica et al. (ibid.). Kaufman et al. ( Mot. Cell. Biol. 5:1750-1759, 1985) and Verheijen et al. (EMBO J. 

is 5:3525-3530. 1986). A cDNA clone comprising the coding sequence for mature t-PA was constructed in the 
inventors' laboratory using as starting material mRNA from the Bowes melanoma ceil line (Rijken and 
Coilen, J. Biol. Chem. 256:7035-7041, 1981). This cDNA was then used to construct the plasmid pDR1296. 
E. coli strain JM83 transformed with pDRl296 has been deposited with American Type Culture Collection 
undiT" accession number 53347. 

20 The cDNA in pDR1296 was extended to include the coding sequence for the pre-pro peptide of t-PA by 
joining it to a fragment constructed from synthesized oligonucleotides. The synthesized pre-pro fragment 
was inserted into Bam Hl-aigested pUC8. Plasmid plCl9R (Marsh et al., Gene 32:481-486, 1984) was 
digested with Sma I and Hind III. The on region of SV40 from map position 270 (Pvu II) to position 5171 
(Hind ill) was then ligated to the linearized plCl9R to produce plasmid Zem67. This plasmid was then 

25 cleaved with Bgl II and the terminator region from the human growth hormone gene (De Noto et al., Nuc. 
Acids Res. 9:3719-3730, 1981) was inserted as a Bgl ll-Bam HI fragment to produce plasmid Zem86 (Figure 
1). The synthesized t-PA pre-pro sequence was removed from the pUC8 vector by digestion with Bam HI 
and Xho II. This fragment was inserted into Bgl Il-digested Zem36 to produce plasmid Zem88. Plasmid 
pDR1296 was digested with Bgl II and Bam HI and the t-PA cDNA fragment was isolated and inserted into 

30 Bgl If-cut Zem88. The resultant piasmid was designated Zem94. 

The vector Zem99. comprising the MT-1 promoter, complete t-PA coding sequence, and the human 
growth hormone (hGH) terminator, was then assembled as shown in Figure 1. A Kpn l-Bam HI fragment 
comprising the MT-1 promoter was isolated from MThGH111 (Palmiter et aL, Science 222:809-814, 1983) 
and inserted into pUC18 to construct Zem93. Plasmid EV142, comprising MT-1 and hGH sequences in the 

35 pBR322 derivative pBX322 (Palmiter et a!., ibid.), was digested with Eco Rl and the fragment comprising 
the MT-1 promoter and nGH terminator sequences was isolated. This fragment was cloned into Eco Rl- 
digested pUC13 to construct oiasmid Zem4. Zem93 was then linearized by digestion with Bam HI and Sal I. 
Zem4 was digested with Bgl II and Sal I and the hGH terminator was purified. The t-PA pre-pro sequence 
was removed from the pUC9 vector as a Sau 3A fragment. The three DNA fragments were then joined and 

-o a plasmid having the structure of Zem97 (Figure 2) was selected. Zem97 was cut with Bgl II and the Bgl ll- 
Bam HI t-PA fragment frcm oDR1296 was inserted. The resultant vector was designated Zem99. 

Plasmid pSV2-DHFR •SuDramani et al., ibid ) was digested with Cfo I, and the fragment containing the 
DHFR cDNA and the 3 attached SV40 sequences was isolated, repaired, and ligated to Bam HI linkers. 
After digestion with Bam HI, an approximately 800 bp fragment containing the entire cDNA and the SV40 

J5 terminator region was purified and ligated to Bam Hl-digested pUC8. As shown in Figure 2. Zem67 was 
digested with Bgl II anc ligated with the Bam HI DHFR-SV40 fragment to generate plasmid Zeml76. 
Plasmid Zem93 was digested with Sst I and re-iigated to generate plasmid Zem106, in which approximately 
600 bp of sequence 5 to the MT-1 promoter was eliminated. Plasmid Zeml06 was digested with Eco Rl 
and ligated to the Eco Rl fragment containing the DHFR gene from piasmid Zem176. The resulting plasmid 

so was designated Zts13. Plasmid Ztsl3 was digested with Bam HI and ligated to the Bam HI fragment from 
plasmid Zem99 containing the entire t-PA coding region and hGH terminator sequence. The resulting 
plasmid was designated Zts15. Zts15 was partially digested with Bam HI, repaired and re-!igated to 
generate plasmid Zem219, in which the 3' Bam Hi site was destroyed (Figure 2). Zem219 was then partially 
digested with Xba I, repaired and religated to generate plasmid Zem219a, in which the 3 Xba I site was 

55 destroyed (Figure 5). 

A DNA sequence encoding aprotinin was then inserted into Zem219a in place of the sequence 
encoding mature t-PA. Zsm219a was digested with Bgl II and Xba I. and the vector fragment was 
recovered. Plasmid pKFN-414, a pUC-based plasmid containing a synthetic aprotinin gene, was digested 
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*• "~ - .-a il arc Xca !. arc *"e ~ : -zO ::c acrctmn ^agment was recovered. . ; ~e seccence cf :he synthetic 
\z:z\.r\r. ~ere .s snc.vn r ->gure "5 .■ ^^;3 ; -agment :acks tne 5 enc cf ;ne acrc<:nn ceding secuence. 
Ol.g-ruc-eoiides ZC1903 <se-se srarc, 5 3AT CTA 3GC CTG ATT TCT GTT TGG AAC CTC CAT ACA 
C" 2 anc ZC.909 .'ar.r-se^se. 5 3 AC CAG TGT A" GAG GTT CCA AAC AG A AAT CAG GCC TA 3') 
■*. t'v ar.reaied tc crevice t~e 5 en;; : ; :.~e aprotimn seauen.ee ana a Sgi il s;:e to \z\r\ "ne aprotimn 
s;:-5rca ;n frame .v::h :re :-P- r,'c~orc secuence. Tne -150 op aDrc;in.p. ^agment. the annealed 
: ; nucleotides anc tne Zem.2!9a Gagmen: -sre men jcinec tc construct Zem252. 

P'asmias Zem2i9a anc Zerr.252 were cctransfectea into tk"BHK cei;s by the caicium phosphate 
crrcacure. Transfected ceils were selected -vith methotrexate and screened for croauction of t-PA. Positive 
:;cnes were raaiolabelied anc ce tera rrecia were screened f cr aprctinin crcducticn using a rabbit anti- 
aprotmin antibody. Aprotinin-prcducmg c:cnes were selected. 

Example 2 



^reduction of Plasminogen 



A. Cloning cf Alpha- 1 -Antitrypsin cDNA 

A cDNA coding for the predominant form of human a-1 -antitrypsin (AAT) was isolated from a human 
i'ver cDNA library by conventional crceeaures using the baboon sequence (Kurachi et al., Proc. Natl. Acad. 
Sci. USA 78:6826-6830. T980; and C^anora et al., Biochem. Biophys. Res. Comm. 103:751-758, 1981) as a 
DNA hybridization probe. The hbrary .vas constructed by inserting human iiver cDNA into the Pst I site of 
the plasmid pBR322 (Bolivar et al.. Gene 2:95-1 1 3, 1977). The AAT cDNA was isolated from the library as a 
1500 base pair (bp; Pst I fragment. T^is fragment was inserted into the Pst I site of pUCl3 to produce the 
c:asmid pUCal. In pUCal the AAT sequence is flanked on the 3 end by Xba I and Eco Rl sites in the 
colylinker. This cDNA sequence -vas used to construct the plasmid PFATPOT, illustrated in Figure 4. 
Plasmid PFATPOT has been deposited w-th ATCC as a Saccharomyces cerevisiae strain E18 transformant, 
accession number 20699. 

The AAT cDNA was then joined to the TPI1 (those phosphate isomerase gene) terminator in the 
plasmid pMVRl. This plasmid :urtner comprises the TPI1 promoter and was assembled in the following 
manner. Plasmid plC7 (Marsh et ai.. Gene 32:481-486, 1984) was digested with Eco Rl, the fragment ends 
*ere blunted with DNA polymerase I (Kienow fragment), and the linear DNA was recircutarized using T4 
DNA jgase. The resulting plasmid was used to transform E. coli RR1 . Plasmid DNA was prepared from the 
t-ansformants and screenea fcr tne loss of the Eco Rl site"~A~plasmid having the correct restriction pattern 
^as designated plC7RT. The TPM promoter fragment was obtained from plasmid 0TPIC1O (Alber and 
Kawasaki, J. Molec. Appl. Genet, i. 419-434, 1982) as illustrated in Figure 4. This plasmid was cut at the 
umcue Kon I site within the TPn ~ene. The TPM coding region was removed with Bal31 exonuciease, and 
an Eco Rl linker (sequence: GG.AATTCC) was added to the 3' end of the promoter. Digestion with Bgl II and 
Ecc PI yielded a TPM promoter fragment having Bgl II and Eco Rl sticky ends. This fragment was then 
,c:ned to plasmid YRp7' (StinchcomD et ai., Nature 282:3943, 1979) which had been cut with Bgl II and Eco 
R!. The resulting plasmid, TE32, was cleaved with Eco Rl and Bam Hi to remove a portion of the 
tetracycline resistance gene. The linearized plasmid was then recircularized by the addition of the 
creviousiy described Eco R!-Bam Ht linker to produce plasmid TEA32. Plasmia TEA32 was digested with 
Bgl II and Eco Rl and the approximately 900 bp partial TPM promoter fragment was gel purified. Plasmid 
piCi9H (Marsh et ai., ibid.) was cut with Bgl II and Eco Rl and the vector fragment was gel purified. The 
TPM promoter fragment was then ligated to the linearized plCl9H and the mixture was used to transform E. 
coii RR1. Plasmid DNA was prepared and screened for the presence of an approximately 900 bp Bgl ll-Eco 
RHragment A correct plasmid was selected and designated pICTPiP. Plasmid plC7Rr was digested with 
H ! nd 111 and Nar I and the 2500 bp fragment was gel purified. The partial TPM oromoter fragment fca. 900 
bet *as removed frcm pICTPIP using Nar I and Sph I and was gel purified. oFATPOT was digested with 
Son I and Hind 111 and the 1750 bp fragment comprising a portion of the TPM oromoter, the AAT cDNA, and 
the ~PM terminator was gel purified. The piC7RT fragment, the TPM oromoter fragment, anc the TPM 
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promoter-AAT- TPM terminator fragment ;rcm pFATPOT were then ccmcined in a triple ligation to nroduce 
pMVR1 (Figure 4). 



5 B. Construction of Aipha-1 -Antiiryosin Expression Vector and Transfection of BHK Cells 



For expression and secretion of alpha- 1 -antitrypsin by transfected mammaiian cells, the Bam Hi to Xba 
i fragment from plasmid pMVFU, containing the entire coding region for AAT from amino acid number 2. 
w was isolated and inserted into Bam HI, Xba I digested Zem219a (Example 1). 

To construct the AAT expression vector, Zem219a was digested with Bgl II and Xba I and the t-PA 
coding sequence was removed. Oligonucleotides ZC1173 ( s ' GAT CTT CA 3 ') and ZC1174 ( 5 ' GAT CTG 
AA 3 ') were annealed to form a Bgl ll-Xho II adaptor. The AAT fragment, adaptor and linearized vector were 
then joined in a three-part ligation to construct Zem235 (Figure 5). The adaptor correctly aligned the reading 
15 frames of the t-PA pre-pro and AAT sequences and restored the glu codon which was missing from the 5* 
end of the AAT sequence. 

Plasmid Zem235 was transfected into tk~ BHK ceils by electroporation. Methotrexate selection was 
applied after 48 hours, and after two weeks a number of clones were picked, expanded and characterized 
for levels of alpha- 1 -antitrypsin secreted into the culture media. One clone, which secreted AAT at a rate of 
20 about 20 ug/ml/day, was selected and designated 235-6. 



C. Construction of Plasminogen Expression Vector and Transfection of Cells 



25 

A cDNA encoding plasminogen was obtained from Dr. Mark Markson of the University of Washington, 
Seattle. Washington. The clone had been isolated from a lambda phage library screened with a partial 
cDNA clone described by Malinowski et al. (ibid.). The sequence of the cDNA is shown in Figure 6. 

Lambda phage DNA was prepared from the positive clone according to conventional procedures. Phage 
30 DNA was subjected to a partial Eco Rl digestion and an approximately 2800 bp Eco Rl fragment containing 
the entire ceding region was recovered and cloned into the Eco Ri site of pUC19 to construct plasmid 
pUC19-Plg. 

The 183 bp Bal l-Eco Ri fragment containing the 5 end of the coding sequence was isolated from 
pUC19-Plg and cloned into Sma I. Eco Rl-digested pUCl8 in order to add a Bam HI site to the 5 end. The 

35 resultant plasmid was digested with Bam Hi and Eco Ri and the 190 bp fragment was isolated. 

Plasmid pUCl9-Plg was digested with Eco Ri and Eco RV and the fragment containing the middle 
region of the cDNA was isolated. 

To obtain the 3 portion of the plasminogen cDNA, a fragment beginning at the codon for amino acid 
541 was subcloned in pUC118. The resulting plasmid was digested with Eco RV and Xba I and the 

-o approximately 660 bp 3 plasminogen fragment was isolated. 

The three plasminogen cDNA fragments (Bam Hl-Eco Rl, Eco Rl-Eco RV ana Eco RV-Xba I) were 
combined with Bam HI, Xba I digested Zem2l9b for ligation to generate plasmid 219b-Plg. (Zem2i9b was 
derived from Zem219a by digesting that vector with Bam HI and Xba I, removing the t-PA cDNA 
sequences, and ligatmg the vector fragment with a Bam Hl-Xba I adaptor.) The Bam HI plasminogen 

45 fragment was then removed from Zem2l9b-Pig and inserted into Bam Hl-aigestea Zem228. To construct 
Zem228, Zem67 was digested with Hind 111 and Bgl II and the Hind Ill-Bam HI neo + SV40 terminator 
fragment from pSV2neo (Southern and Berg, J. Mol. AppL Genet. 1:327-341, 1982) was inserted. The 
resultant vector was designated Zem220. P1asmid"Zem93 was digested~with Sst I to remove upstream MT-1 
sequences and the vector was recircuiarized to- construct Zem106. Zem220 was digested with Eco Rl and 

50 the fragment containing the expression unit of SV40 promoter-neo-SV40 terminator was recovered and 
joined to Eco Rl-digested Zeml06. The resultant vector, designated Zem223, contained the SV40 and MT-1 
promoters in opposite orientation. Zem223 was digested with Bam HI and a 237 bp Bel I- Bam HI SV40 
terminator fragment was inserted. The resultant plasmid was designated Zem228 (Figure 3). The Zem228- 
derived plasminogen expression vector was designated Zem228-Plg. 

55 Zem228-Plg was transfected into the ATT-expressing cell line 235-6 by the electroporation procedure, 

essentially as described by Neumann (ibid.). Colonies were selected and assayed for plasminogen 
production by enzyme-linked immunosoroent assay (ELISA). The assay used a goat polyclonal antiserum to 
human plasminogen (American Diagnostica) as the capture antibody. Immunoreactive material was detected 
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. ^rrs : ; c;*;r,'a:ec 'cccit cc ,c:opa. ant;cccy against ^uman c asmrcger arc avicin-ccnjugatec 
:;^erac:£" ce r c/icase. 

~^.n cosi::ve cxnes were cut into six-weii tissue culture aisnes. When the ceils were 30-90% confluent, 
r, .ve r e cut into cvst£ine-ce : !C:en! mecium :cr one hcur. Fifty — icrccuries cf :E S-eys:e:ne was adced to 
a;- -veil 1 m; culture vo^me weil) the cehs .vere mcubatea overnight, ana :ne -r.ecfa were harvested for 

The :e!is .vere ,v as r.ea .vitn cc;c P3S, anc an extract was precarec :o assay : c r cytopiasmic 
asmmcgen. The csiis were susoended in 1 mi RIP A buffer (10 mM Tris pH 7.4, 1% ceoxycholate, 1% 
'itcn X-100. 0.1 "o SDS, 5 nM EDTA, 0.7 M NaCI). The lysates were freeze-thawed two times on dry ice 
rc centrifugea :n the ccid for 15 minutes at 10.000 rpm. The supernatants were then used for 
idioimmune precipitation. 

Media and cell extract samples were assayed for plasminogen by radioimmune precipitation. Samples 
n a /otume of 0.1 to 1.0 ml) were combined with 5-10 ul of preimmune serum, incubated on ice for 1 hour 
ra 'hen mixed with 50-100 uJ cf Staphylococcus aureus (Pansorbin. Sigma Chemical Co.. St. Louis, MO) 
o-c mcubated ; or 1 hour on ice. After centnfugation, the supernatants were mixed with 5 ul of rabbit 
ciycicnai antiserum to human plasminogen (Boehringer-Mannheim) and incubated on ice for one hour, 
ifty u! of Staphylococcus aureus was added and the mixture was incubated one hour on ice. The cells 
ere pelleted anc the pellets were washed with one ml PBS containing 0.5% NP-40. 0.1% SDS. The 
asned celts were ceileted, resuspended in 50 ul PBS plus 50 ul 2X loading buffer (0.1 M Tris pH 6.8, 
5% glycerol. 3.2% SDS, 3% i-mercaptoethanoi, 0.002% bromohenoi blue), boiled 5 minutes and 
'■ecrccncresea cn a 10% coiyacryiamiae gei. Proteins were visualized oy silver staining ana autoradiog- 
raphy. 

Assay results showed that AAT-producmg cells secreted full-length plasminogen into the culture media. 
^ contrast, control BHK cells (i.e., cells transfected with 219b-Plg but not Zem235) did not secrete 
electable amounts of plasminogen and contained degraded plasminogen in the cytoplasm (Figure 7). 

Example 3 



Production of Protein C 



A. Protein C cDNA Cloning 



A cDNA ccamg for a cordon of human protein C was prepared as described by Foster and Davie (ibid.). 
ie : 'y. a .vgtl 1 cDNA library was prepared from human liver mRNA by conventional methods. Clones were 
rreered using '—labeled affinity-purified antibody to human protein C, and phage were prepared from 
:s:tive clones by the olate lysate method (Maniatis et aL, ibid.), followed by banding on a cesium chloride 
-ac;ent. The cDNA inserts were removed using Eco Rl and subcloned into ptasmid pUC9 fVieira and 
-lessing, Gene J_9 ; 259-268, 1982). Restriction fragments were subcloned in the phage vectors M13mp10 
r.a M!3mo1i (Messing, Meth. in Enzymoiogy 101:20-77. 1983) and sequenced by the dideoxy method 
Sanger et al.. Proc. Natl. Acad. Sci. USA 74:5463-5467, 1977). A clone was selected which contained DNA 
crresponding to the known partial sequence of human protein C (Kisiei. J. Clin. Invest. 64:761-769, 1979) 
nd encoded protein C beginning at amino acid 64 of the light chain and extending througJTthe heavy chain 
nd nto the 3 non-ceding region. This clone was designated \HC1375. A second cDNA clone coding for 
rotem C from amino acid 24 was identified. The insert from this clone was subcloned into pUC9 and the 
.lasmia cesignatea oHCX6L. This clone encodes a major portion of protein C, including the heavy chain 
:oaing region, termination codon, and 3 non-coding region. 

The cDNA insert from XHC1375 was nick translated using a- 32 P dNTP's and used to probe a human 
genomic library in phage XCharon 4A (Maniatis et al., Ceil 15:687-702, 1978) using the plaque hybridization 
crccedure of Benton and Davis ( Science 196 :181-182, 1977) as modified by Woo (Meth. in Enzymoiogy 
-3:381-395. 1979). Positive clones were isolated and plaque-purified (Foster et al., Proc. Natl. Acad. Sci. 
USA 82:4673-4677. 1985, herein incorporated by reference). Phage DNA prepared from positive clones 
Si. navy" et al.. in Experiments with Gene Fusion, Cold Spring Harbor Laboratory, 1984) was digested with 
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Ecc FA or Bgi il arc tne genomic :nserts .vers purified and 5ucc:oned in oUC9. insert restriction fragments 
were suocioned into Ml 3 vectors and sequer.cea to confirnn their identity and estaotisn the ONA sequence 
of the entire gene. 

A genomic fragment containing an exon corresponding to ammo acids -42 to -19 of the ore-pro peptide 
5 of orctein C was isolated, nick translated, and usea as a probe to screen a cDNA library constructed by the 
tecnnicue of Gubler and Hoffman ( Gene 25:263-269, 1983) using mRNA from Hep G2 ceHs. This ceil line 
was derived from numan hepatocytes and was previously shown to synthesize protein C rFair anc Bahnak, 
BiooG 64:194-204. 1984). Ten positive clones comprising cDNA inserted into the Eco Rl site of phage Xgt1 1 
were isolated and screened with an oligonucleotide probe corresponding to the 5 non-coamg region of the 
t c protein C gene. One clone was also positive with this probe and its entire nucleotide sequence was 
determined. The cDNA contained^70 bp of 5 untranslated sequence, the entire coding sequence for human 
prepro-protein C. and the entire 3 non-coding region corresponding to the second polyadenylation site. 

"5 B. Construction of Vector pD5 

The vector pD5 was derived from pDHFRIIi as shown in Figure 8. The Pst I site immediately upstream 
from the DHFR seauence in pDHFRIIi fBerkner and Sharp, Nuc. Acids Res 13:841-857. 1985) was 

20 converted to a Bam HI site by digesting 10 ug of plasmid with 5 units of Pst I for To minutes at 37* C in 
100 u\ buffer A (10 mM Tris pH 8, 10 mM MgCl2, 6 mM Nad, 7 mM £-MSH). The DNA was phenol 
extracted, EtOH precipitated, and resuspended in 40 ul buffer B (50 mM Tris pH 8, 7 mM MgCi 2 , 7 mM £- 
MSH) containing 10 mM dCTP and 16 units T4 DNA polymerase and incubated at 12' C for 60 minutes. 
Following EtOH precipitation, the DNA was ligated to 2.5 ug kinased Bam HI linkers in 14 ul buffer C (10 

25 mM Tris pH 8, 10 mM MgCIa, 1 mM DTT, 1.4 mM ATP) containing 400 units T4 poiynucieotide ligase for 
12 hours at 12* C. Following phenol extraction and EtOH precipitation, the DNA was resuspended in 120 ul 
buffer D (75 mM KCI. 6 mM Tris pH 7.5. 10 mM MgCI 2 . 1 mM DTT), digested with 100 units Bam HI for 60 
minutes at 50* C, then electrophoresed through agarose. The 4.9 kb DNA fragment containing pBR322 and 
vector sequences (10 ug) was isolated from the gel, ligated in 10 ul buffer C containing 50 units T4 

30 polynucleotide ligase for 2 hours at 12* C, and used to transform E. coii HB101. Positive colonies were 
identified by rapid DNA preparation analysis, and plasmid DNA (designated pDHFR ) was prepared. 

Plasmid pDl was then generated by first cleaving pSV40 (comprising Bam HI digested SV40 DNA 
cloned into the Bam HI site of PML-1 [Lusky and Botchan, Nature 293:79-81, 1981]) (25 Leg) in 100 ul buffer 
D with 25 units Bel I for 60 minutes at 50* C, followed by the addition of 50 units of Bam HI and additional 

35 incubation at 37 *C for 60 minutes. Plasmid pDHFR' was linearized with Bam HI and treated with calf 
intestinal phospnatase. DNA fragments were resoived by agarose gel electrophoresis, and the 4.9 kb 
pDHFR' fragment and 0.2 kb SV40 fragment were isolated. These fragments (200 ng pDHFR' DNA and 100 
ng SV40 DNA) were incubated in 10 ul buffer C containing 100 units T4 polynucleotide ligase for 4 hours at 
12" C. and the resulting construct (pDl) was usea to transform E. coli RR1. 

40 As shown in Figure 8, plasmid pDl was modified by deleting the "poison" sequences in the pBR322 

region (Lusky and Botchan, ibid.). Plasmids pD1 (6.6 ug) and pML-1 Lusky and Botchan, ibid.) (4 ug) were 
incubated in 50 ul buffer A with 10 units each Eco Rl and Nru I for 2 hours at 37' C, followed by agarose 
gel electrophoresis. The 1.7 kb pD1 fragment and 1.8 kb PML-1 fragment were isolated and ligated together 
(50 ng each) in 20 ul buffer C containing 100 units T4 poiynucieotide Itgase for 2 hours at 12' C. followed 
by transformation into E. coli HB101. Colonies containing the desired construct (cesignated ppD1) were 
identified by rapid preoaration analysis. Ten ug of ppD1 was then digested with 20 units each Eco Rl and 
Bg! II in 50 ul buffer A for 2 hours at 37* C. The DNA was electrophoresed througn agarose, and the 
desired 2.8 kb fragment (fragment C) comprising the PML-1. 3' splice site and poly A sequences was 
isolated. 

so To generate the remaining fragments used in constructing pD5, pDHFRIIi was modified to convert the 

Sac II (Sst II) site into either a Hind III or Kpn I site. Ten ug pDHFRIIi were digested with 20 units Sst II for 
2 hours at 37' C, followed by phenol extraction and ethanol precipitation. Resuspended DNA was incubated 
in 100 ul buffer B containing 10 mM dCTP and 16 units T4 DNA polymerase for 60 minutes at 12 # C, 
phenol extracted, dialyzed, and ethanol precipitated. DNA (5 ug) was ligated with 50 ng kinased Hind III or 

55 Kpn I linkers in 20 ul buffer C containing 400 units T4 DNA ligase for 10 hours at 12* C, phenol extracted, 
and ethanol precipitated. After resuspension in 50 ul buffer A, the resultant plasmids were digested with 50 
units Hind lil or Kpn I, as appropriate, and electrophoresed through agarose. Gel-isolated DNA (250 ng) was 
ligated in 30 ul buffer C containing 400 units T4 DNA ligase for 4 hours at 12* C and usee to transform E. 
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- : T-e -esctant cashes .ve-s r-esignatec PuhFRlil 'H:nc '!;» arc cCHF-i.:; .K:^ li. A 0.4 ^p Ecc Ri- 

- "ragmen: -fragment A . .\as = o^n~;ea ircm pOHFRIll (Ken cy cigesren .v;tn E:o R! ana Kpr. i 
: : '-z-vec by agarose gei eiec:rccn.c r 35:5. 

~-e 374Q enharcer sec-e~ce .%as inserted 'ntc pDHFRNI fHinc ill; as .renews: 50 ^g SV-iO DNA was 
" "-:^5a m 120 puffer A w ; th 50 units Hi no ill for 2 hours at 37' C. ana the Hind it! C S74Q rragment 
5'"'-tC46 cpf -as ge- pur ; :ec. 3 -asmic pDHFRiil 'Hind 111) (10 _ig; .vas * r ea:ea -vtth 250 ~g can' ntestmai 
:-~:spnatase for 1 hour at 3 T C. c~er.o: exfactea arc ethanot crecipitatec. The meanzea piasm-; (50 ngj 
-as ..gated .vith 250 ng hind Hi C SV40 fragment in 16 ui buffer C for 3 hours at 12' C. using 2C/ units T4 
-O! /nucleotide ligase, and transfcrmea into E. coli HB1Q1. A 0.9 kb Kpn l-Bgi II 'ragment tfragrrv \ 3) was 
then soiated from this clasmia. 

For the final construction of pD5. fragments A ana B (50 ng each) and 10 ng fragment C wr . e Iigated 
.vtth 200 units T4 polynucleotide ligase for 4 hours at 12* C, and the mixture was transfected into E. coli 
PRi. Positive colonies were aetectea oy r apid preoaration analysis, and a large-scale preparation oT pD5 
F-gure 8) was mace. 



C. Construction of Expression Vector p594 



The expression of protein C cDNA was achieved in the vector pDX. This vector was derived from pDl 1 
ana cD5 . Plasmid pD5 is icenticai to oD5 except that the SV40 polyadenyiation signal (i.e.. the SV40 Bam 
HI '2533 bp] to Bel I [2770 bo] fragment) is in the late orientation. Thus, pD5' contains a Bam HI site as the 
site of gene insertion. Plasmid dD11 differs from pD5 in that the Hind III (5171 bp in the SV40 genome) to 
Kpn I (294 bp in SV40) fragment, containing enhancer sequences, is in the opposite orientation. 

To generate pDX, the Eco Rl site in dD11 was converted to a Bel I site by Eco Rl cleavage, incubation 
*ith S1 nuclease, and subsequent ligation with Bel I linkers. DNA was prepared from a positively identified 
:oicry, and the 1.9 kb Xho l-Pst I fragment containing the altered restriction site was prepared via agarose 
gei electrophoresis. In a seconc modification, Bel l-cieaved pD5' was Iigated with kinased Eco Rl-Bcl I 
aoactcrs (constructed from oligonucleotides ZC525, 5' GGAATTCT 3'; and ZC526. 5 GATCAGAATTCC 3') 
in : r cer to generate an Eco Rl site as the position for inserting a gene into the expression vector. Positive 
colonies were identified by restriction endonuclease analysis, and DNA from this was used to isolate a 2.3 
*<d Xho l-Pst I fragment containing the modified restriction site. The two above-aesenbed DNA fragments 
*ere ncubated together with T4 DNA ligase and transformed into E. coli HB101, and positive colonies were 
identified by restriction analysis. A preparation of such DNA, termecTpDX (Figure 9), was then made. This 
oiasm.id contains a unique Eco Rl site for insertion of foreign genes. 

T-e protein C cDNA was then inserted into pDX as an Eco Rl fragment. Recombinant plasmids were 
screened by restriction analysis to identify those having the protein C insert in the correct orientation with 
-aspect to the promoter elements, and plasmid DNA (designated pDXPC) was prepared from a correct 
: one. Because the cDNA insert in cDX^PC contains an ATG codon in the 5 non-coding region, deletion 
mutagenesis was performed on the oDNA prior to transfection and expression exoenments. Deletion of the 
:^ree oase pairs was performed accorcing to standard procedures of oiigonucieotide-directed mutagenesis. 
Tv "e cDX-based vector containing the modified cDNA was designated p594. 

D. Expression of Protein C in a KEX2 Transfected Cell Line 



T'^e Saccharomyces cerevisiae KEX2 gene was isolated from a yeast genomic iibrary by screening 
transformed kex2 mutant ceils for production of an a-factor halo on a lawn of suitable tester cells. One clone 
^as obtained which complemented ail reported defects of kex2 mutations (mating, a-factor production, 
maturation of killer toxin ana sporulation in a kex2 homozygous diploid strain). The gene was subcloned into 
a pUC vector under the control of the yeast GAU promoter. The resultant plasmid, designated pi 51 5. has 
been deposited with American Type Culture Collection as an E. coli HB101 transformant under accession 
-umber 67569. As shown in Figure 10, p1515 was digested~w7th" Hind 111 and a 2.1 kb fragment was 
^covered. This fragment was Iigated to Hind Ill-cut pUCl8 to construct plasmid pUCl8/KEX2. The KEX2 
'ragment (2.1 kb) was then isolated 'rom DUC13/KEX2 by digesting the plasmid partially with Hind II! and to 
completion with Bam HI. The remainder of the KEX2 sequence was then isolated as a 0.43 kb fragment 
from a Bam HI + Hind III digest of pi 51 5. The two KEX2 fragments were then Iigated into the Bam HI site 
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of the vectors Zem223 ard Iem229 ' r: gure i0). i'Zem223 is similar to Zerr.228 but contains a DHFR gene 
m place of trie necr^vcn -esisiar.ce gene. A cion;ng s;:e is 'tanked oy the MT-1 oromcter and SV40 
terminator.) The resulting piasmics were designated KEX2/Zem22S and KEX2/Zem229, respectively. 

The BHK ceil line tk~ts"3 //as co-transrected with the piasmics p594 and pSV2-DHFR by the calcium 

5 pnosphate procedure. Transfectec cans were selectea with 250 nM methotrexate (MTX) and clonal ceil lines 
were isolated. A clonal ceil !;ne wmcn secreted protein C into the culture medium at 1.5 pg. cell day was 
selected anc designated PC594-2C4 8HK. 

Ten ug of KEX2/Zenr223 was transiected into the PC594-204. BHK ceii line by the caictum phosphate 
procedure. Ceils were cultured m the presence of 250 nM MTX at all times. Clones were selected with 500 

w Ig/ml G418 and twelve clonal ceit lines were selected. 

The selected clones were puise-labeled with 2S S-cysteine in cysteine-free MEM (Gibco) containing 1% 
fetal calf serum for 24 hours. The culture media were collected and examined for the presence of single- 
chain and cleaved two-chain protein C by immunoprecipitation with a monoclonal antibody to protein C. 250 
ul of media was combined with 10 ug of the antibody and the mixture was incubated at 37* C for one hour. 

75 100 ul of Staph A cell suspension (Pharmacia, Piscataway, NJ) was added and incubation was continued at 
37' C for one hour. The ceils were oeileted by centrifugation and the pellet was resuspended in TBS. The 
cells were again pelleted and the pellet was resuspended in 60 ul of gel buffer containing 1% tf- 
mercaptoethanol. The suspension was heated to 100° C for three minutes, then electrophoresed on an SDS- 
polyacrylamide gel. Prcieins were visualized by autoradiography. The parent cell line, PC594-204/BHK. 

20 showed approximately 70% of the protein C in the one-chain form, with the remaining 30% in the two-chain 
form. One of the G41 8-selectea KEX2 cell lines, designated PC594-204.KEX2-1 , produced 95% two-chain 
protein C, with the remaining 5% m the one-chain form. 

25 E. Modification of the Protein C Processing Site 



To enhance the processing of single-chain protein C to the two-chain form, two additional argintne 
residues were introducec into the protein, resulting in a cleavage site consisting of four basic amino acids. 

30 The resultant mutant precursor of protein C was designated PC962. It contains the sequence Ser-His-Leu- 
Arg-Arg-Lys-Arg-Asp at tne cleavage site between the light and heavy chains (Table 3). Processing at the 
Arg-Asp bond results m a two-chain protein C molecule. 

The mutant moiecu.e was generated by altering the cioned cDNA by site-specific mutagenesis 
(essentially as described by Zciier and Smith, DNA 3:479-488, 1984) using the mutagenic oligonucleotide 

35 ZC962 (5' AGT CAC CTG AGA AGA AAA. CGA~GAC A 3'). Plasmid p594 was digested with Sst I. the 
approximately 87 bp fragment ^as c'oned into Ml3mp11, and single-stranded template DNA was isolated. 
Following mutagenesis, a correct clone was identified by sequencing. Replicative form DNA was isolated 
and digested with Sst I tc sciate the mutagenized fragment, which was joined with Sst l-cut p594 in a two 
part ligation. Clones ra.'sng t^e Sst ! fragment inserted in the desired orientation were identified by 

-£0 restriction enzyme macc.rg. T'~e resulting expression vector was designated pDX/PC962. 

Plasmid pDX/PC962 -vas cc -transfected into tk" ts13 BHK cells with 0SV2-DHFR (Subramani et al., 
Mol. Ceil. Biol. 1:854-864 1981) by the calcium phosphate procedure (essentially as described by Graham 
and van der Eb~ ibid.). T-e transfected cells were grown m Dulbecco's modified Eagle's medium (MEM) 
containing 10% fetat caif serum i x PSN antibiotic mix (Gibco 600-5640). 2.0 mM L-glutamine and vitamin K 

45 (5 ug/mt). The cells were selected in 250 nM methotrexate (MTX) for 14 days, and the resulting colonies 
were screened by the immunctnter assay (McCracken and Brown, BioTechniques. 82-87, March/ April 1984). 
Plates were rinsed with PBS or No Serum medium (Dulbecco's plus penicillin-streptomycin, 5 ug/mi vitamin 
K). Teflon® mesh (Spectrum Medical Industries, Los Angeles, CA) was then placed over the cells. 
Nitrocellulose filters were wetted with PBS or No Serum medium, as appropriate, and placed over the 

so mesh. After four hours incubation at 37* C, filters were removed and placed in 50 mM Tris pH 7.4, 5 mM 
EDTA, 0.05% NP-40, 150 mM NaCI, 0.25% gelatin for 30 minutes at room temperature. The filters were 
incubated for 1 hour at room temperature, with shaking, in biotin-labeled sheep polyclonal antibody to 
protein C, 1 ug/ml in the same buffer. Filters were then washed in the buffer and incubated 1 hour at room 
temperature, with shaking, in avidin-conjugated horseradish peroxidase (Boehringer-Mannheim), 1:1000 in 

55 the same buffer. Filters were washed in 50 mM Tris-HCI. pH 7.4, 5 mM EDTA, 1 mM NaCI, 0.25% gelatin, 
0.4% Sarkosyl, 0.05% NP40, then in H 2 0, and incubated in color reagent (60 mg HRP color development 
reagent [Bio-Rad], 20 ml methanol, 100 ul H2O2 in 100 ml 50 mM Tris pH 7.4, 150 mM NaCI). The reaction 
was stopped by transferring the filters to H 2 0. Six of the most intensely reacting colonies were picked by 
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zicn.nc arc crc.vr rc:/.cua:lv ;r 1 0 im c lates. 
~-::em C crcccttn eveis 'rem "ear v ccr ; ;uen: zuitures -vers measurec ov enzyme-. ^<ec im- 
~"'-r:3crceni assay : EL!SA> Affinity cur.nea zoiyclcrat cr heavy znain-soec: ; ic mcrcccrai ar*i!cccy :o 
"-ran crotem C MCO j.! of a 1 ml solution n 0.1 M NaeCGi. cH 9.5). was addea tc each wei 1 cf 95 .veil 
~~ ■ t: ;n r .er plates ana the orates .vere mc-careo overngnt at 4 C. The wens /vere tr.e^ wasrec "Tree lines 
■■ 5 ~M cn:sc~a:6 Cu ;f er. or- " 5 0.15 M NaC) containing 0.05% T /-een-20 and men ;rcucated 

'CO j.: of 1 % bovine serum aibumin. 0.05% T<veen-20 in P3S at 4 C evermore The oiates -vere men 
nsed several times witn PBS, air cried, anc stored at 4' C. To assay the samples, 100 U of eacn samp;e 
••as mcjoatea fcr 1 hour at 3T C -n the coated wetls ana the .veils were nnsea with 0.05% T.veen-20 :n 
PES Tine plates were men incubated for 1 hour at 37 *C with a biotin-conjugated. affinity-purirec. sheep 
colyclonal antibody to protein C {30 ng/mi) in PBS containing 1 % bovine serum aibumin ana 0.05% Tween- 
20 The wells were nnsea with PBS ana incubated again for 1 hour at 37* C with avidin-ccnjugated alkaline 
z^csohatase m PBS containing 1% bovine serum albumin and 0.05% Tween-20. The wells were then 
■ ~sea .vi th PBS. and alkaline chosphatase activity was measured by the addition cf 100 - I of chosDhatase 
sucstrate 'Sigma 104; 600 ^g ml) n *Q% diethanolamine, pH 9.8. containing 0.3 mM MgCoj 2 . The 
acscrcance at 405 nm was read on a microtiter plate reader. Resuits are given in Tabie 1. 

TABLE 1 



Clone 


Cell NumDer 


ELISA 


pg cell/day 




fx =0~ 7 ) 


ng.ml 




962-1 


1.1 


2500 


2.20 


-2 


0.8 


1250 


1.56 


-3 


12 


1350 


1.12 


-4 


1.2 


550 


0,46 


-5 


1.2 


1550 


1 30 


-6 


1 .2 


950 


0.80 



~>e clone BHK/962-1 was grown in larger scale culture, and several hundred micrograms of crotem C were 
curded by affinity cnromatography on a column prepared by coupling 7 mg of polyclonal sheen antibody 
against numan protein C to 2 grams of CNBr-activated Sepharcse 4B (Pharmacia Inc.. Piscataway, NJ). Ceil 
culture medium was applied to the column, the column was washed with 100 ml TBS (50 mM Tris, pH 7.5. 
150 mM NaCI). and the protein C was eluted with TBS containing 3 M KSCN or with pH 11.5 buffer (25 mM 
zc'assium phosphate. pH 115, 0.2 M NaCI. 2% Tween 30. 0.5% NaN^). Western btct analysis dem- 
:nstrated that the mutant protein C was approximately 95% in the two-chain form, comparea to about 20% 
:.vc-chain orotetn C cotained from BHK ceils transfected with the native sequence. 

Milligram quantities of crctein C were purified from either stable BHK cell clones exoressing :he PC962 
~-u:ant protein or stable 293 cei 1 clones e<pressing the wild-type protein C (p594 transferee cedsj using a 
~~crocional antibody column specific for the calcium-induced conformation of protein C. Cell culture media 
•-ere applied to the column in the presence of 5 mM CaCb. Protein C was eluted from the column with TBS 
regaining 10 mM EDTA. The use of this purification method permitted purification of completely active 
cotem C without exposure to cenatunng conditions. The purified protein C was analyzed by SDS/PAGE 
Allowed by silver staining and was shown to be >95% pure. 

The BHK-prcauced PC962 protein was assayed for its ability to be activated to a form which shows 
zcth amidolytic and anticoagulant activities. Affinity-purified protein samples were exhaustively diaiyzed 
against TBS then activated by incubation at 37° C for 1 hour with 0.1 volume of 1 unit/ml Protac C 
American Diagnostica). Amidolytic activity was measured by adding aliquots of the activation mixture to 
*00 u.l of 1 mM protein C substrate (Spectrozyme PCa, American Diagnostica) in a microtiter well and 
measuring the change in At C 5 over time using a microtiter plate reader. Anticoagulant activity of the 
activated protein C was assayed as described by Sugo et al. (ibid.). The affinity-purified PC962 protein was 
demonstrated to be fully active in both amidolytic and anticoagulant assays. Elution from the antibody 
column with pH 11.5 buffer was shown to yield a protein with higher activity than that obtained using 3 M 
KSCN elution. 

Clonal ceil lines from the DDX/PC962 transfection into BHK cells were isolated by a process of limiting 
:. ut.on. One plate of MTX-selected colonies (approximately 300 colonies) was trypsinizeti. counted, and re- 
lated into microtiter wells at an average of 0.5 cell/well. These were grown up in selective media containing 
ISO nM MTX. About 50% of me wells contained colonies. Wells containing identifiable colonies fi-2 mm 
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diameter) were assay ec :v ELiSA for protein C level in the media. For 'his assay, fresh medium was added 
to ail the weils, [he oiates were :ncuoated f or 75 minutes, then the meaium was removed and assayed. Five 
colonies which gave 75-m;nute accumu.ations of greater than 50 ng/mt (corresponding to over 1000 
ng ml day) were scitt into 10-cm oiates tor larger scaie culture. Protein C production ievels for these clones 

5 ranged from 1.1 to 2.3 pg.ceil.cay. 

A second piasmid, designated PC229 962, was constructed by inserting the PC962 cDNA into piasmid 
Zem229. An Eco FU fragment containing the PC962 cDNA from oDXPC962 was ligated, with Eco Rl-Bam 
H! synthetic oligonucleotide adapters, to Zem229, which had been cut with Bam HI and treated with 
phosphatase. The resulting vector is PC229/962. 

ro Piasmid PC229/962 was transfected into BHK cells by the calcium phosphate method. Celts were 
cultured in Dulbecco's MEM containing 10% fetal calf serum and 5 ug/ml vitamin K. The 48-hour transient 
expression level from this transfection was approximately 25 ng/ml. After 2 days, the transfected cells were 
split into selective media containing 250 nM MTX and cultured for an additional 14 days. Three pfates from 
this transfection, containing approximately 200 colonies each, were screened by the immunofilter assay, 

is and the 24 most intensely reacting colonies were picked by cylinder cloning. These were grown individually 
m 10-cm plates, and their protein C production levels were measured. Colonies producing between 1.1 and 
2.3 pg/ceil/day were used for the production of stable, protein C-producing cell lines. 

Expression vector pDX/PC962 and piasmid pKO-neo were co-transfected by the calcium phosphate 
method into 293 cells. Transfected cells were split into media containing 500 ug/ml G418 after 48 hours. 

20 After 10 days in selective media, immunofilter assays were done, and two clones were picked by cylinder 
cloning. Protein C production was found to range from 1 to 2 pg/cell/day. The cultures were scaled up, and 
protein C was purified by immuno-affinity chromatography. Greater than 95% of the protein C was found to 
be in the two-chain form. 

The structure of the 962 mutant protein prepared from BHK and 293 cells was compared to that of wiid- 

25 type protein C from 293 ceils and from plasma. Analysis by SDS/PAGE followed by silver staining showed 
that all the recombinant proteins contained heavy and light chains which co-migrated with those of the 
plasma protein. The wild-type protein C synthesized in 293 cells contained a significant amount 
(approximately 20%) of single-chain, unprocessed protein of Mr = 66,000, whereas the mutant protein 
produced in either cell type was essentially completely processed to two chains. N-terminal sequence 

3G analysis snowed that both the light and heavy chains of the recombinant wild-type and BHK/PC962 mutant 
proteins were properly processed. The extent of gamma carboxylation of the recombinant proteins was 
measured by two distinct ELISA systems. The first system recognizes both gamma-carboxyiated and non- 
carboxylated forms of the protein, while the second utilizes specific antibodies which only recognize protein 
C which has undergone a gla-mduced conformational change in the presence of calcium. Analysis indicated 

35 that approximately 60% of the recombinant protein C produced in BHK cells and 90%-95% of that 
produced in 293 cells was sufficiently gamma carboxyiated to be recognized by the specific antibodies. 

The three recombinant orotems were also analyzed for amidolytic and anticoagulant activity and the 
results were compared to the activity of plasma protein C. PC962 from BHK cells and wild-type protein C 
from 293 cells both snowed full amiaolytic activity. In the anticoagulant assay, protein C from BHK cells had 

■io essentially the same soecific activity as plasma protein C, whereas both wild-type and PC962 mutant 
proteins from 293 cells consistently exhibited approximately 25% greater specific activity. One unit of 
protein C activity is defined as the amount in 1 ml of normal human plasma, which contains 4 ug of protein 
C per 1 ml (Gardiner and Griffin, Prog. Hematol. 13:265-278, 1983) specific activity = 250 units-mg). Wild- 
type protein C produced m 293 ceils was consistently in excess of 300 units/mg. 

F. Expression of Activated Protein C 



so The cDNA sequence for protein C was altered by site-specific mutagenesis to delete the portion 

encoding the activation peptide. The altered sequence was then transfected into BHK and 293 cells and 
stably transfected cells were selected. Active protein C was detected in culture media samples from both 
cell lines. 

To delete the activation peptide coding sequence, piasmid p594 was digested with Sst I and the -880 
55 bp fragment was purified and inserted into the Sst i site of Ml3mpl0 (Messing, Methods Enzymol. 101 :20- 
77, 1983). The 12 activation peptide cocons were deleted by oligonucleotide-directea deletion mutagenesis 
(Zoller and Smith, DNA 3:479-488. 1984) using the mutagenic oligonucleotide ZC829 {5 CTG AAA CGA 
CTC ATT GAT 3 ). Replicative form DNA was prepared from mutant phage clones and digested with Sst I. 
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:: C5"".os .vere s: r eerec : zr z r zzer centatxn of the Ss; . tragrr.ert oy restriction nraccmg u5:n: 5c. n. A 
:c-ec: oiasmia was se.ectec arc designated cPCS29. P'asmic pPC329 -as secuencec to /enfy the 

■6 r C9 01 "he OeSireC COCiPQ 5 9C LOT. 09 

asnia oPC329 .vas co-t'ans;ectsc ~tc tk~5HK ceils <wir o;asmtd PSVDHr" ■Las at =«.. Mature 
Ij^ 225-232. 1382)) arc 293 :et!s oKG-rec iScutrern arc Eerg. j. Mot. Acci. Genet. 1 327-341 , 

'3c2m by ca:cium phcscrate :corec;pitat:cn (Graham ana van oer Eb, icic.). After 43 ncurs. culture media 
were rarvestec and assayea ~zr protein C cy EL!SA. Results are shewn in Table 2. At the same time, 
cji'u-es //ere split 1:5 into media containing 500 u.g mi of G418 (293 ceils) or 250 nM methotrexate 
■tk~3HK ceils;. After 10 cays m the presence of selective media, stably transfected colonies were screened 
for orotein C production by immunofilter assay. 

Positive colonies were picked and grown in selective media (containing 500 ug/ml G418 or 250 nM 
methotrexate, as appropriate) 'or 10 days. Culture media were assayed for APC activity by chromogenic 
assay. Media samoles were aooed to microliter wells containing -00 u\ of 0.2 Spectrozyme PCa (American 
Diagnostica -336) in 50 mM T.-is pH 7.5, 150 mM NaCI. P'ates were incubated at 37* C. and the A— 3 .vas 
measured at various time intervals. Representative results from one transfected 293 cell line 'designated 
829-20) are shown in Figure 11. Media from positive colonies of line 829-20 consistently showed nigher 
acti/r.y with the chrcmogentc substrate for APC than did control media which had been .ncubated with non- 
• ransrected 293 cells for the same length of time (10 days). 

TABLE 2 



TRANSIENT 


EXPRESSION OF 


ACTIVATED 


PROTEIN C (ELISA) 


Ceil 


Protein C 


Line 


ng/ml in Media 


BHK 


2.7 


293 


30 



A DNA seauence encoding an activated protein C precursor with the processing site sequence Arg-Arg- 
L/s-Arg was constructed by mutagenesis of the wild-type protein C sequence. The resultant sequence 
'designated 1058) was analogous to that encoding PC962, but lacked the portion encoding the activation 
peptide fTabie 3). 

The protein C sequence o^esent in plasmid p594 was altered in a single mutagenesis to delete the 
cccons for the activation pept;oe and insert the Arg-Arg cocens at the processing site. Mutagenesis was 
ce-formed en the 870 bp Sst I fragment from p594 essentially as described above using oligonucleotide 
ZC'053 '5' CGC AGT CAC CTG AGA AGA AAA CGA CTC ATT GAT GGG 3'}. 

The mutagenized sequence was used to construct expression vector pDXPClQ58 (analogous to 
oCX PC962) and the vector was co-transfected into BHK cells as described above. The protein was ounfied 
•zr. a polyclonal antibody column eluted with pH 11.5 buffer. 

The activity of the PC 1058 orotein was comoared to that of activated piasma protein C and activated 
PC362. Piasma protein C ana PC962 (5 ag/ml) were activated by treatment with 110 volume Protac C 
■Arr.er.can Diagnostica) for 2 hours. Anticoagulant activity was assayed by combining 50 ul human oiasma 
with 50 ul activated protein C and incubating the mixtures at 37* C for 150 seconds. To this mixture was 
added 50 ul activated ceDhalooiastin (American Scientific Products, McGaw Park, IL), and the mixture was 
incubated at 37' C for 300 seconds. One hundred u\ of 20 mM CaCl2 was aaaed and the clotting time was 
recorded. Data are presented in Figure 12. 



G. Expression of Activated Protein C and KEX2 



A high protein C producing pDX/PC1058 transfected BHK clone (pDX/PCl 058-3//BHK) was identified 
ard transfected with KEX2/Zam229 by the calcium phosonate procedure. Transfected cells were selected 
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with 500 j.z ml G4i 3 and 250 rM meihctrexate. 

A selected clone designated .-<.EX2-1 058' BHK was pulse-labeied with : - S-cysteme in cysteme-rree 
DMEM fGibco) containing 1 % fetal calf serum for 24 hours. The culture media were collected and examined 
for tne presence of singie-cham ar.o cleavea. r.vo-chain activated protein C by ^mmunoprecipitation with a 

= mcnccional antibody to protein C. T.vc hundred fifty u.1 of meaia was combined .vith 10 ug of antibody and 
the mixture was mcuoatec 1 at 37 C for one hour. One hundred al of Staph A cell suspension (Pharmacia. 
Piscataway, NJ) was added and the mature was incubated at 37* C for one hour. The cells were peileted 
by centrifugation and the pellet was resuspended in 60 ul of gei buffer containing 1% -3-mercaptoethanol. 
The suspension was heated to 100* C for three minutes, then electrophoresed on an SDS-polyacrylamide 

10 gel. Proteins were visualized by autoradiography. The KEX2-1 058//BHK clone showed approximately 100% 
cleavage of the protein into the two-chain form. 

Activated protein C was produced from the KEX2-1 058//BHK clone grown to confiuency in Dulbecco's 
MEM supplemented with 10% fetal calf serum, 250 nM methotrexate and 500 o.g;ml G418. The confluent 
cells were switched to Dulbecco's MEM supplemented with 1 ug/ml fibronecun. 2 ug/ml insulin, 5 ug.ml 

'5 transferrin, 5 ug/ml vitamin K. 1x PSN antibiotic mix (Gibco 600-5640), 2.0 mM L-glutamme, 250 nM 
methotrexate and 500 ug. ml G418. Media was collected every 1 to 2 days over a period of 7 days and was 
frozen at -20* . The frozen media samples were thawed and filtered through a 0.45 am filter to remove any 
cell debris. Solid calcium chloride was added to a final concentration of 5 mM and sofid sodium azide was 
added to a final concentration of 0.02% (W/V). Activated protein C was removed from the media using a 

20 monoclonal antibody column specific for the caicium-tnduced conformation of protein C. The treated media 
were applied to the column, and activated protein C was eluted with TBS containing 10 mM EDTA, Protein 
C concentration was determined by absorbance at 280 nm and by ELISA. 

Protein C activity was measured by coagulation assay. Affinity purified plasma protein C was incubated 
with ACC-C (Agkistrodon contortrix contortrix protease [Kisiel et aL, J. Biol. Chem. 262:12607-12613, 1987] 

25 obtained from W. Kisiel, University of New Mexico, Albuquerque, N.M.) diluted in 50 mM Tris, 100 mM 
NaCI and 0.1% bovine serum aibumm at a ratio of 500:1 (APC:ACC-C) for 2 hours at 37' C. Affinity purified 
activated protein C from KEX2-1 058//BHK cells was incubated for 2 hours at 37' C. Clot formation was 
measured in an MLA Electra 800 Coagulation Timer (Medical Laboratory Automation, Inc., Pleasantville, 
NY). One hundred i±\ of activated plasma protein C or KEX2-1058 activated protein C was added to an MLA 

30 cuvette and warmed for 50 seconds to raise the temperature to 37* C. One hundred ul of Dade Actin FS 
(American Scientific Products) was added and the test solutions were incubated for 100 sec. One hundred 
ul of 5 mM CaCb was added to each cuvette. The time required for clot formation was measured. Results 
of coagulation assays shewed that activated protein C produced by KEX2-1 058//BHK cells is approximately 
100% active relative to activated olasma protein C. 

35 The carooxy-terminai sequence of the lignt chain of KEX2-1058 protein C was compared to the 

carboxy-terminai sequence of the light chain of commercially available protein C (American Diagnostica) 
using CNBr cleavage at the umaue methionine residue of the light chain to liberate a peptide which could 
be sequenced in Its entirety oy N-rerminal seauence analysis. Affinity purified orotem C from KEX2- 
1058/-BHK cells grown in Dulbecco's MEM supplemented with 1% fetal calf serum, 250 nM methotrexate 
and 500 ag/ml G418 was first reduced by the addition of a 10-fo!d molar excess (per Cys residue) of 
dithiotthreitoi (DTT) in 0 2 M Tris-HCI. pH 8.3, and guanidine-HC! to a final concentration of 6.0 M. The 
mixture was incubated at 65 C for 4-6 hours, loaoacetic acid, pH 7.0, or icdoacetic amide was added to the 
reduced protein in a four-fold moiar excess over the molar concentration or DTT, and the mixture was 
incubated for 30 minutes at 37' C. The solution was dialized against 0.1 M NHiHC0 3 , pH 8.5 for 24 hours 

45 at 22* C. The dialized solution was apolied to an HPLC Poly-F column (Dupont) to isolate the light chain. A 
500-fold moiar excess of CNBr per methionine residue was added to the purified light chain in 70% formic 
acid under nitrogen for 30 hours at room temperature in the dark. The CNBr digest was applied to an 
American Biosystems Inc. Model 470A sequenator (Marine-on-St. Croix, MN). The resultant sequences 
showed that the C-terminal seouence of both the commercially available purified protein C and the KEX2- 

so 1058 protein ended with Glu. indicating that the light chains of both proteins terminate at amino acid 149. 

H ' C Qnstruct ' on and Expression of p1 645/Zem229R. 

55 

A DNA sequence encoding the amino acids Arg-Arg-Lys was substituted for the DNA sequence 
encoding amino acids 9-11 of the activation peptide encoded by the protein seouence of plasmid p962 
(Table 3: the ammo acids which have been added to the sequence encoding wiid-type protein C appear in 
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:; : r. zrz scaces ce7-ee r a^ re a:.:s a-e .se: sc:ely for a.icn'pg tre : gr7t arc r.eavy ;na:n sequences 7 
iransfeciec wi:n sec:e:ec ac7/a:ea protein C .nio *re cu;ture mecia. P'asmia c962 <vas 

:.ges:ea .v-th 3a; i ana Ss: i ana :r.e ~3C 7C 'ragment was punfiec and <rsened ;rua Mi3rr.Dt0 which haa 
7 ireanzea by c.ces: .on ^ 5a. a:~c Sst i. Synthetic Ouconuceoroes ZC * 545 (5 3AA GAC CAA 

- : ^-a AC A AAA, CGG C" ~" GAT 3 ■ and ZC550 '5' TCC CAG TCA CGA CGT 3'} were used to 
^ a cenze *.ne s;^c ; e-sr a^aec 'e~7.a:e 2\A c r ecarec 7:m tre 'esuitant onage cy sue-cirected : n v;*ro 
■-" -tage. n .es;s iZoller arc S~;:r.. cic- ~'^e ^utant pnage were sucjectea to oiceexv-sequencing to confirm 
mutagenesis. Repiicative ;crm iri DNA £ rom a confirmea mutant clone. cesignatea 1645. was prepared 
anc .vas digestea with Sst i arc Pst i :z .soiate the 411 bp fragment. Piasmia o229 962 (Example 3.E.) was 

*: rgestea wan Ecc Rl anc Pst ! to sc:ate the 592 bp protein C f-agment. Piasmic p229. 962 was also 
digested with Eco Rl and Sst I to iscate the 700 bp protein C fragment. The 411 bp protein C fragment 
'rem the 1645 rf, the 411 od orotem C fragment from p229/962, and the 700 bp protein C fragment were 
.cined m a four-part 'igation with pZem229R which had been linearized with Eco Rl and treated with caif 
;r*es;inal phosphatase tc credent seii- icat.cn. (Plasmid pZem229R is similar to Zem229 except that the Eco 

■5 Pi sites have been destroyed by canai digestion, biunt ending by Klenow fill, religation, subsequent 
: gest:cn with Bam HI and -eugation 3am Hl-Eco Rl adaoters.) A correct plasmid was selected ana was 
designated pPCl645>229R. 

Table 3 



Amino Acid Sequences of Cleavage-Site Mutants 
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170 




WT (594) 


-S-H-L- 


K-R-D-T- 


E-D-Q- 


E-D-Q-V-D-P-R- 


L-I-D-- 




829 


-S-H-L- 


K-R- 






L-I-D-- 


20 


962 


-S-H-L- 


R-R-K-R-D-T- 


E-D-Q- 


E-D-Q-V-D-P-R- 


L-I-D-- 




1058 


-S-H-L- 


R-R-K-R- 






L-I-D- 




1645 


-S-H-L- 


R-R-K-R-D-T- 


E-D-Q- 


E-D-Q-R-R-K-R- 


L-I-D- 


■ 5 


1880 


-S-H-L- 


R-R-K-R-D-T 




D-Q-R-R-K-R- 


L-I-D- 


1953 


-S-H-L- 


R-R-K-R- 




R-R-K-R- 


L-I-D- 




1954 


-S-H-L- 


R-R-K-R-D- 




Q-R-R-K-R- 


L-I-D- 



P'asmid pPCl645;229P -vas trarsfected into tk~ BHK cells oy calcium pnosohate coorecipitation 
'Graram and van der Eb. ac ... Transacted ceiis were subjected to selection with 1 uM methotrexate ana 
media were assayed for protein C by EL1SA (Example 3.E.). A positive clone was grown in Dulbecco's MEM 
suoc emented with i0 o/ c : etai calf serum and 1 llM methotrexate until the ceiis reached confluency. The 
c:nfiuent ceils were switcnec tc Zucecco's MEM supplemented with 1 °o fetal caif serum and 1 uM 
methotrexate. Media were coiiectec every 1 to 2 days over a period of 7 days and frozen at -20 C. The 
frozen media samples were thawea ana filtered through a 0.45 urn filter to remove any cell debris. Solid 
calcium chloride was added tc a fin ai concentration of 5 mM and solid sodium azide was added to a final 
concentration of 0.02% (W/V) protein C was recovered from the media using a monoclonal antibody 
column specific for the calcium-induced conformation of protein C. The treated media were applied to the 
oo'umn. and protein C was eiuteo with TBS containing 10 mM EDTA. Protein C concentration was 
-etermined by absorbance at 230 nm and by ELISA. 

Activated protein C produced frcm pPCl 645/229 transfected cells was comoared to an equivalent 
amount of PC229/962 protein C produced from transfected cells using a chromogenic assay. One ug of 
affinity purified protein C diluted in 40 ul TBS + EDTA was added to each well of a 96-well plate. Forty ul 
55 of 2 mM Spectrozyme PCa (American Diagnostica Inc, New York. NY) was added to each well and 
pcucated at 37 *C until there was sufficient color development. Activity was measured as an increase in 
acscrbance at 405 nm. The results snowed that the activated orotein C oroduced from pPC1 645/229 
transfected cells was 5-1 0% more active than the PC229/962-proaucea protein C. 



20 



EP 0 319 944 A2 



!. Construction a-c Expression cf oPCl 680. 229R 



The DNA sequence encoding prctein C >n piasmid 1345 was further modified to remove the first, 
5 second, seventh ana eight amino acids cf tne activation peotide (Table 3). Single-strandea 1545 template 
DNA -vas prepared ana was suojecteG to site-directed in vitrc mutagenesis (Zoller and Smith, ibid.) using 
synthetic oiigcnuc:eoiides ZC1380 -s'AAA CGA GAC ACA GAC CAA AGA AGA 3 # ) ana ZC550. Positive 
clones were subjected to dideoxy sequencing to confirm the mutagenesis. A positive clone was identified 
and was designated 1880, 

10 Replicative form DNA prepared from clone 1880 was digested with Sst I and Pst t to isolate the 411 bp 

fragment. Piasmid PC229/962 was digested with Eco Rl and Pst I to isolate the 562 bp protein C fragment. 
Piasmid PC229/962 was also digested with Eco Rl and Pst i to isolate the 700 bp protein C fragment. The 
411 bp protein C fragment from the 1880 rf and the 700 bp and 562 bp fragments from PC229/962 were 
joined with pZem229R, which had been digested with Eco Rl, in a four-part ligation. A correct piasmid was 

r5 selected and was designated pPCl 880/229R. 

Piasmid pPCl880/229R was transfected into tk" BHK cells and assayed as previously described. 

J. Construction ana Expression of pPC1954/229R 

20 

The coding sequence of the activation peptide present in piasmid 1645 is altered to remove the second 
through seventh ammo acid codons of the activation peptide, resulting in a fusion between the first and 
eighth amino acid codons of the activation peptide present in 1645. Single-stranded 1645 template DNA is 

25 prepared and is subjected to site directed in vitro mutagenesis using the synthetic oligonucleotides ZC1954 
(5' GAG AAG AAA ACG AGA CCA AAG AAG AAA AC 3 # ) and ZC550. Positive clones are sequenced to 
confirm the mutagenesis. A positive clone is selected and is designated 1954. The amino acid sequence at 
the junction of the light and heavy chains of the encoded protein is shown in Table 3. 

Replicative form DNA is prepared from clone 1954 and is digested with Sst I and Pst I to isolate the 

30 approximately 400 bp mutagenized protein C fragment. Piasmid pPC229/962 is digested with Eco Rl and 
Pst i and with Sst I and Eco Ri to isolate the 562 bp Eco Rl-Pst I fragment and the 700 bp protein C 
fragment, respectively. The approximately 400 bp protein C fragment from the 1954 rf and the 700 bp and 
562 bp fragments from PC229/962 are joined with pZem229R, which has been digested with Eco Rl, in a 
four-part ligation. 

35 A correct plasmic is selected and is designated pPC1954/229R. 

Piasmid pPCl954/229R is transfected into tk" BHK cells by calcium phosphate coprecipitation. Cells 
are selected and assayed as previously described. 

40 K. Construction and Exoression of pPCl953/229R 

The coding sequence of the activation peptide present in piasmid 1645 is altered to remove the first 
through eighth amino acid codons of the activation peptide, resulting in a fusion between the first and 

45 second sets of Arg-Arg-Lys-Arg amino acid codons present in 1645. Single-stranded 1645 template DNA is 
prepared and is subjected to site directed in vitro mutagenesis using the synthetic oligonucleotides ZC1953 
(5' ACC TCA GAA GAA AAC GAA GAA"GAA AAC GGC TCA T 3') and ZC550. Positive clones are 
sequenced to confirm the mutagenesis, A positive clone is selected and is designated 1953. The amino 
acid sequence at the light chain-heavy chain junction of the encoded protein is shown in Table 3. 

50 Replicative form DNA is prepared from clone 1953 and is digested with Sst I and Pst I to isolate the 

approximately 400 bp mutagenized protein C fragment. Piasmid PC229/962 is digested with Eco Rl and Pst 
1 and with Sst I and Eco Rl to isolate the 562 bp Eco Rl-Pst 1 fragment and the 700 bp protein C fragment, 
respectively. The approximately 400 bp protein C fragment from the 1953 rf and the 700 bp and 562 bp 
fragments from PC229/962 are joined with pZem229R, which has been digested with Eco Rl, in a four-part 

55 ligation. A correct piasmid is selected and is designated pPCl 953/229R. Piasmid pPC1953/229R is 
transfected into tk~BHK cells by calcium phosphate coprecipitation. Cells are selected and assayed as 
previously described. 
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Cc-e:*'cress:cr cr Ccagu: = !ion ractcrs VM arc "X 



= : asrr.id FIXcD2 (Busby et ai.. Nature 316.271-273. 1985) was aigestea ,vith. Bam HI, ana the 1.4 kb 
actor iX fragment was recovered. This fragment was then joined to pD5 wmcn hac been cigestea with 
;am HI and treated with calf intestinal phosphatase. The resultant plasmid was designated FIXpD5 . 

To co-express factor VII and factor IX, 10 ug of FIX^pDS . 10 ug of FVII(565-2463VpDX (Hagen et al., 
:P 200.421: ATCC 40205) and 1 j,g of DHFRreS-pD5' (a pD5 -derived piasmid containing a methotrexate 
es stant DHFR gene) [Levinson et a. , EP 1 17,060]) were usea to transfect tk~ 3HK ceils. Transfected cells 
'-6*e selected in the presence of methotrexate, then assayed for production of factor VII and factor IX by 
mm-jrofiiter assay using antibodies to both oroteins. Colonies which were positive : or both factor VII and 
act.v iX production were selectee, grown up and pulsed with 25 S-cysteme. Culture media and intracellular 
crctems *ere immunoprecipitated with the factor VII and IX antibodies and analyzed by poiyacrylamide gel 
riecttcphcresis. Celts co-expressing factor VII and factor IX produced two-chain factor VII (i.e. factor Vila), 
n ciPtrast. ceils producing only factor VII showed only the single chain fcrm of the protein (Figure 13). 

Example 5 



Co-expression in Saccharomyces cerevisiae 



Fhe S. cerevisiae BAR1 gene encodes a secreted protein known as Barrier. The secretory peptide 
cortion ofThe BAR1 gene product or the secretory peptide pius the third (C-termmal) domain of Barrier may 
:e used to facilitate the secretion of 'oreign proteins produced in S. cerevisiae (MacKay. WO 87 02670 and 
Mac Kay et al., European Patent Application 88116335.6, both of which are incorporated herein by reference 
n their entirety). 

As described in European Patent Application 88116335.6, an expression vector containing the sequence 
ercrcing the signal peptide and third domain of Barrier fused to the coding sequence for the B(1-29)Ala- 
A!a-L/s-AM-21 ) insulin precursor iMarkussen et at., EP 163,529; hereinafter referred to as MI-3) was 
;orst"jcted. This vector, designated pSW195 (Figure 14), also contains the yeast TPI1 promoter and 
:orrrinator. The Barrier and insulin sequences are joined at the amino acid sequence lys-arg. To allow 
ororessing of the fusion protein by thrombin, the lys-arg junction seauence was mutagemzed to pro-arg. 
Cleavage at this site by thromom wil. result in secretion of the unfused insuiin precursor. 

As shown in Figure 14, plasmid pSW195 was digested with Sph I and Sal I to isolate the 1.7 kb 
: 'agr-.ent comprising the BAR1 insulin fusion and the TPI1 terminator. This fragment was ligated with 
\1i3mol8 which had been previously digested to completion with Sph I and Sal I. The resultant phage 
:.cre .vas designated mpl8-ZV172. Oligonucleotide ZC1083 (5' TCC TTG GAT CCA AGA TTC GTT 3') 
.vas used to mutagenize mp18-ZV172 using the uracil method (Kunkel, Prcc. Nat'l Acad. Sci. USA 82:488- 
492. 1985). The resultant mutants were sequenced to confirm the mutagenesis and a positive clone was 
ties grated ZV172 / 1083. For convenience, the insert present in ZV172'1083 was subcloned into pUC18. The 
1.7 <o Sph l-Sat I insert from ZV1 72/1 083 was isolated and ligated with pUCl8 which had been previously 
digested to completion with Sph I and Sal I. The resultant plasmid. pZV130, was digested to completion 
■vi : u Sal L The adhesive ends of the linearized pZV180 were blunted using DNA polymerase I (Klenow 
Tagment) and ligated to kinased Bgl II linkers. Excess linkers were removed by digestion with Bgl II. The 
tinkered DNA was then cut to completion with Sph I to isolate the 1.7 kb insert. The 1.7 kb insert, 
comprising the partial TPI1 promoter, the BAR1-MI-3 fusion and the TPI1 terminator, was ligated into the 
Sph l-Bam HI partial TPI1 promoter-vector fragment of plasmid pSW207 to construct pZV187. (pSW207 
*as derived from pCPOT [ATCC 39685] by replacing the 750 bp Sph !-8am HI fragment of pCPOT 
:cntaming 2 micron and pBR322 sequences with a 186 bp Sph l-Bam HI fragment derived from the 
cBP322 tetracaine resistance gene; destroying the Sph I site and inserting a Not I site in the tetracycline 
'esistance gene: digesting the resultant plasmid with Not i and Bam Hi; ana inserting a Not l-Bam HI TPI1 
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promoter fragment in place r ;ne Not i-Bam HI pBR322-danveci sequence.) 

The thromcin cDNA s sciatea 'rem a prcthromoin cDNA cloned in the Pst I site of p8R322 fFriezner- 
Degan et at., ibid.). The KEX2 gene : rom pi 51 5 (Example 3) is Digested and mantouiated to remove the 
catalytic domain and the -err.ainirg sequences are joined to the thromom cDNA. An expression unit is 

5 preDared by joining me "vena gene to the TPI1 promoter and terminator. This expression unit is then 
substituted for the coding region of me yeast 3AR1 gene. The resultant construct, comprising the 
expression unit flanked cy BARl gene non-coding sequences, is used to transforms, cerevisiae. The 
transformed cells are cuiturea ana screened for production of thrombin by enzyme activity assay. A colony 
positive for thrombin production is tnen transformed with pZV187). Cells are cultured and the insulin 

jo precursor is isolated from the media. 

From the foregoing it will be appreciated that, although specific embodiments of the invention have 
been described herein for purposes of illustration, various modification may be made without deviating from 
the spirit and scope of the invention. Accordingly, the invention is not to be limited except as be the 
appended claims. 

>5 ATCC 20699, 39685, 53347 ana 67569 are ail deposi ts made under the Budapest Treaty arc were 

made on 13 April 1984, 9 Vlay 1984, 10 December 1985 and 3 December 1987, respectively. 
United States Patent Acpiication No. 053412 is available 

The features disctosec in the foregoing description, in the following ciaims and/or in the accompanying 
drawings may, both separately and in any combination thereof, be material for realising the invention in 
20 diverse forms thereof. 



Claims 

25 1. A eukaryotic host ceil transacted or transformed with a first DNA sequence encoding a protein of 
interest and at least one adcitional DNA sequence, said additional DNA sequence encoding a protein which 
processes or stabilizes the orotein of interest. 

2. The host cell of c:aim 1 wnerem the eukaryotic host cell is a mammalian host cell or a yeast host 

cell. 

30 3. The host cell of c:aim 1 wherein the first DNA sequence is setected from the group consisting of 

sequences encoding t-PA factor V!l. factor IX, factor X, protein C, activated protein C, protein S, 
plasminogen, insulin and derivatives and analogs thereof. 

4. The host cell of claim i wherein the first DNA sequence encodes a serine protease. 

5. The host ceil of c:aim i therein the additional DNA sequence is selected from the group consisting 
35 of sequences encoding proteases, crotease inhibitors, and proteins which bind to the protein of interest. 

6. The host cell of c:aim t wherein the first DNA sequence encodes factor VII and the additional DNA 
sequence encodes factor ix. 

7. The host ceil of c a.^ 1 /vrerem the first DNA sequence encodes protein C or activated protein C 
and the additional DNA secuence is the yeast KEX2 gene. 

jo 8. The host cell of cam i wnerein the first DNA sequence encodes t-PA and the additional DNA 
sequence encodes a protease mrioitor selected from the group consisting of TfMP, trypsin inhibitors and 
aprotinin. 

9. The host cell of o a>m 1 wnerem the first DNA sequence encodes plasminogen and the additional 
DNA sequence encodes a. era- 1 -antitrypsin or a variant thereof. 
45 10. The host cell of o a ; m 1 wherein the first DNA sequence encodes protein C and the additional DNA 

sequence encodes protein S. 

11. The host cell of c:3im 1 therein the first DNA sequence encodes factor VII and the additional DNA 
sequence encodes tissue 'actor. 

12. A method for procucmg a protein of interest, comprising: 

so culturing a host cell accorcing to any of claims 1-11 under conditions which allow the first DNA sequence 
and the additional DNA secuence(s) to be expressed; and 
isolating the protein of interest from the host cell. 

13. The method of claim 12 including, prior to the step of culturing, cotransfecting or cotransforming the 
host cell with multiple vectors, each containing a separate expression unit. 

55 14. The method of claim 12 including, prior to the step of culturing, transfecting or transforming the host 

cell with a single vector containing multiple expression units. 



23 



EP 0 319 944 A2 



'3 ~he '~e:hcc z i z 3:-n '2 -e-^c;rg. c ;r :z z:sz zi OLi.ter — c. trarsrectrg me ~es: :el A'i*m a 
r -g e .e::cr zcntammg a s;rg:e e.-.c-ess-o^ -mit ma: :s t'arsonce:: rtc a co.yoistror.: message, .vr.sre.n 
-~.es: :eii is a mam man an oen. 

'5. A DNA sequence .%ric n codes l z- a cmtem x avmg substantia. iy the same cicegica 1 activity as 
-■./-an activated crctem C. saic secuence ; -::er cec;r-g ;er me amine ac;C sequence -P:-P — ru-X-R= - 
- -P--P:-H. .vrerein L s essentia::-/ me em rham er eefvatea protein C. R--R = are L/s er Arg X s a 
eectfce eenc cr a scacer cec:,ee c* 1-2 ammo ac:cs, a.ne H is essentially the neavy cnam ef activatea 
omtem C. 

'7. Mammalian ce;!s transfected with an exoression vector caoacle cf integration m mammalian host 
eel DNA saia expression vector oomDnsmg a promoter eperaoiy linked to a DNA sequence according to 
claim 16. 

18. The mammalian ceils of claim 17 therein said cells are further transfected with the KEX1 or KEX2 
gene of Saccharomyces cerevisiae. 
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10 20 36 51 

GCACTGCTGG CCAGTCCCAA A AT G G A A CAT AAG GAA GTG GTT CTT CTA CTT CT i TTA 

MET Glu His Lys Glu Val Val Leu Leu Leu Leu Leu 

66 9i 96 lM 

TTT CTG AAA TCA GGT C A A GGA GAG CCT CTG GAT G AC TAT GTG AAT ACC CAG GGG 
Phe Leu Lys Ser Gly Lin Gly Glu Pro Leu Asp Asp Tyr Val Asn Thr Gin Gly 

126 1^1 156 

GCT TCA CTG TTC AGT GTC ACT AAG AAG CAG CTG GGA GCA GGA AGT ATA GAA GAA 
Ala Ser Leu Phe Ser Val Thr Lys Lys Gin Leu Gly Ala Gly Ser Me Glu Glu 

l7 l 166 201 216 

TGT GCA GCA AAA TGT GAG GAG GAC GAA GAA TTC ACC TGC AGG GCA TTC CAA TAT 
Cys Ala Ala Lys Cys Glu Glu Asp Glu Glu Phe Tyr Cys Arg Ala Phe Gin Tyr 

231 2<46 261 

CAC AGT AAA GAG CAA CAA TGT GTG ATA ATG GCT GAA AAC AGG AAG TCC TCC ATA 
His Ser Lys Glu Gin Gin Cys Val lie MET Ala Glu Asn Arg Lys Ser Ser lie 

276 2*1 306 321 

ATC ATT AGG ATG AG A GAT GTA GTT TTA TTT GAA AAG AAA GTG TAT CTC TCA GAG 
lie lie Arg MET Arg Asp Val Val Leu Phe Glu Lys Lys Val Tyr Leu Ser Glu 

336 35 1 366 39 1 

TGC AAG ACT GGG AAT GGA AAG AAC TAC AGA GGG 2CG ATG TCC AAA ACA AAA AAT 
Cys Lys Thr Gly Asn Gly Lys Asn Tyr Arg Gly Thr MET Ser Lys Thr Lys Asn 

396 <* 26 
GGC ATC ACC TGT CAA AAA TGG AGT TCC ACT TCT CCC CAC AGA CCT AGA TTC TCA 
Gly lie Thr Cys Gin Lys Trp Ser Ser Thr Ser Pro His Arg Pro Arg Phe Ser 

i+t+l ^56 < *7 1 

CCT GCT ACA CAC CCC TCA GAG GGA CTG GAG GAG AAC TAC TGC AGA AAT CCA GAC 
Pro Ala Thr His Pro Ser Glu Gly Leu Glu Glu Asn Tyr Cys Arg Asn Pro Asp 

501 516 531 

AAC GAT CCG CAG GGG CCC TGG TGC TAT ACT ACT GAT CCA GAA AAG AGA TAT GAC 
Asn Asp Pro Gin Gly Pro Trp Cys Tyr Thr Thr Asp Pro Glu Lys Arg Tyr Asp 

5^6 561 576 59 1 

.TAC TGC GAC ATT CTT GAG TGT GAA GAG GAA TGT ATG CAT TGC AGT GO A GAA AAC 
Tyr Cys Asp lie Leu Glu Cys Glu Glu Glu Cys MET His Cys Ser Gly Gly Asn 

606 621 636 651 

TAT GAC GGC AAA ATT TCC AAG ACC ATG TCT GGA CTG GAA TGC CAG GCC TGG GAC 
Tyr Asp Gly Lys lie Ser Lys Thr MET Ser Gly Leu Glu Cys Gin Als Trp Asp 

666 681 696 

TCT CAG AGC CCA CAC GCT CAT GGA TAC ATT CCT TCC AAA TTT CCA AAC AAG AAC 
Ser Gin Ser Pro His Ala His Gly Tyr He Pro Ser Cys Phe Pro Asn Lys Asn 
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'~T=T\ ' 726 7<tl 756 

CTG AAG AAG AAT T AC TGT CGT AAC CCC GAG AGG GAG CTG CGC CCT TGG TGT TTC 
Leu Lys Lys Asn Tyr Cys Arg Asn Pro Glu Arg Glu Leu Arg Pro Trp Cys Phe 

771 786 B01 

ACC ACC GAG CCC A AC AAG GCG TGG GAA CTT TGT GAC ATC CCC CGC TGC ACA ACA 
Tnr Thr Asp Pro Asn Lys Arg Trp Glu Leu Cys Asp lie Pro Arg Cys Thr Thr 

B16 B31 B<*6 B61 

CCT CCA CCA TCT TCT GGT CCC ACC TAC CAG TGT CTG AAG GGA ACA GCT GAA AAC 

Pro Pro Pro Ser Ser Gly Pro Thr Tyr Gin Cys Leu Lys Gly Thr Gly Glu Asn 

876 S91 906 921 

TAT CGC GGG AAT GTG GCT GTT ACC CTG TCC GGG CAC ACC TGT CAG CAC TGG AGT 
Tyr Arg Gly Asn Val Ala Val Thr Val Ser Gly His Thr Cys Gin His Trp Ser 

936 951 ^66 

GCA CAG ACC CCT CAC ACA CAT AAC AGG ACA CCA GAA AAC TTC CCC TGC AAA AAT 
Ala Gin Thr Pro His Thr His Asn Arg Thr Pro Glu Asn Phe Pro Cys Lys Asn 

9 Ql 996 1011 1026 

TTG GAT GAA AAC TAC TGC CGC AAT CCT GAC GGA AAA AGG GCC CCA TGG TGC CAT 
Leu Asp Glu Asn Tyr Cys Arg Asn Pro Asp Gly Lys Arg Ala Pro Trp Cys His 

1<*0 1 1056 1071 

ACA ACC AAC AGC C A A GTG CGG TGG GAG TAC TGT AAG ATA CCG TCC TGT GAC TCC 
Thr Thr Asn Ser Gin Val Arg Trp Glu Tyr Cys Lys lie Pro Ser Cys Asp Ser 

1086 1101 1116 1131 

TCC CCA GTA TCC ACG GAA CAA TTG GCT CCC ACA GCA CCA CCT GAG CTA ACC CCT 
Ser Pro Val Ser Thr Glu Gin Leu Ala Pro Thr Ala Pro Pro Glu Leu Thr Pro 

U<»6 M61 1176 1191 

GTG GTC CAG GAC TGC TAC CAT GGT GAT GGA CAG AGC TAC CGA GGC ACA TCC TCC 
Val Val Gin Asp Cys Tyr His Gly Asp Gly Gin Sre Tyr Arg Gly Thr Ser Ser 

1206 1221 1236 

ACC ACC ACC ACA G5A AAG AAG TGT CAG TCT TGG TCA TCT ATG ACA CCA CAC CGG 
Thr Thr Thr Thr Gly Lys Lys Cys Gin Ser Trp Ser Ser MET Thr Pro His Arg 

1251 12<bfc 1291 1296 

CAC CAG AAG ACC CCA GAA AAC TAC CCA AAT GCT GGC CTG ACA ATG AAC TAC TGC 
His Gin Lys Thr Pro Glu Asn Tyr Pro Asn Ala Gly Leu Thr MET Asn Tyr Cys 

1311 1326 13^1 

AGG AAT CCA GAT GCC GAT AAA GGC CCC TGG TGT TTT ACC ACA GAC CCC AGC GTC 
Arg Asn Pro Asp Ala Asp Lys Gly Pro Trp Cys Phe Thr Thr Asp Pro Ser Val 

1356 1371 13B6 l^Ol 

AGG TGG GAG TAC TGC AAC CTG AAA AAA TGC TCA GGA ACA GAA GCG AGT GTT GTA 
Arg Trp Glu Tyr Cys Asn Leu Lys Lys Cys Ser Gly Thr Glu Ala Ser Val Va l 
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1^16 1^3 1 1 l ^ 61 

GCA CCT CCG CCT GTT GTC CTG CTT CCA GAT GTA GAG ACT CCT TCC GAA GAA GAC 
Aid Pro Pro Pro Val Val Leu Leu Pro Asp Val Glu Thr Pro Ser Glu Glu Asp 

TGT ATG TTT GGG AAT GGG AAA GGA T AC CGA GGC AAG AGG GCG ACC ACT GTT ACT 
Cys MET Phe Gly Asn Gly Lys Gly Tyr Arg Gly Lys Arg Ala Thr Thr Val Thr 

l52 l 1536 1551 1566 

GGG ACG CCA TGC C AG G AC TGG GCT GCC CAG GAG CCC CAT AG A CAC AGC ATT TTC 
Gly Thr Pro Cys Glu Asp Trp Ala Ala Gin Glu Pro His Arg His Ser lie Phe 

1581 1596 14*^ 

ACT CCA GAG AC A AAT CCA CGG GCG GGT CTG GAA AAA AAT TAC TGC CGT AAC CCT 
XHr Pro Glu Thr Asn Pro Arg Ala Gly Leu Glu Lys Asn Tyr Cys Arg Asn Pro 

1626 16M 1^5^ 1671 

GAT GGT GAT GTA GGT GGT CCC TGG TGC TAC ACG ACA AAT CCA AG A AAA CTT TAC 
Asp Gly Asp Val Gly Gly Pro Trp Cys Tyr Thr Thr Asn Pro Arg Lys Leu Tyr 

16B6 1701 1716 1731 

GAC TAC TGT GAT GTC CCT CAG TGT GCG GCC CCT TCA TTT GAT TGT GGG AAG CCT 
Asp Tyr Cys Asp Val Pro Gin Cys Ala Ala Pro Ser Phe Asp Cys Gly Lys Pro 

17^6 1761 ^776 

CAA GTG GAG CCG AAG AAA TGT CCT GGA AGG GTT GTA GGG GGG TGT GTG GCC CAC 
Gin Val Glu Pro Lys Lys Cys Pro Gly Arg Val Val Gly Gly Cys Val Ala His 

1791 1006 1821 1936 

CCA CAT TCC TGG CCC TGG CAA GTC AGT CTT AG A ACA AGG TTT GGA ATG CAC TTC 
Pro His Ser Trp Pro Trp Gin Val Ser Leu Arg Thr Arg Phe Gly MET His Phe 

1B51 1866 ieai 

TCT GGA GGC ACC TTG ATA TCC CCA CAG TGG GTG TTG ACT GCT GCC CAC TGC TTG 
Cys Gly Gly Thr Leu lie Ser Pro Glu Trp Val Leu Thr Ala Ala His Cys Leu 

lB 96 1911 1926 1^1 

GAG AAG TCC CCA AGG CCT TCA TCC TAC AAG GTC ATC CTG GGT GCA CAC CAA GAA 
Glu Lys Ser Pro Arg Pro Ser Ser Tyr Lys Val lie Leu Gly Ala His Gin Glu 

1956 1971 1986 HOO 1 

GTG AAT CTC GAA CCG CAT GTT CAG GAA ATA GAA GTG TCT AGG CTG TTG TTG GAG 
Val Asn Leu Glu Pro His Val Gin Glu lie Glu Val Ser Arg Leu Phe Leu Glu 

2016 2031 20^6 

CCC ACA CGA AAA GAT ATT GCC TTG CTA AAG CTA AGC AGT CCT GCC GTC ATC ACT 
Pro Thr Arg Lys Asp lie Ala Leu Leu Lys Leu Ser Ser Pro Ala Val lie Thr 

2061 2076 2091 2106 

GAC AAA GTA ATC CCA GCT TGT CTG CCA TCC CCA AAT TAT GTG GTC GCT GAC CGG 
Asp Lys Val lie Pro Ala Cys L eu Pro Ser Pr o Asn Tyr Val V a l Ala Asp Ar g 
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5121 2136 2151 

ACC GAA TGT TTC ATC ACT GGC TGG GGA GAA ACC CAA GCT ACT TTT GGA GCT GGC 

Thr Glu Cys Phe lie Thr Gly Trp Gly Glu Thr Gin Gly Thr Phe Gly Ala Gly 

2166 21B1 2196 2211 

CTT CTC AAG GAA GCC C AG CTC CCT GTG ATT GAG AAT AAA GTG TGC AAT CGC TAT 

Leu Leu Lys Glu Ala Gin Leu Pro Val lie Glu Asm Lys Val Cys Asn Arg Tyr 

2226 22<*1 2256 227 1 

GAG TTT CTG AAT GGA AGA GTC CAA TCC ACC GAA CTC TGT GCT GGG CAT TTG GCC 

Glu Phe Leu Asn Gly Arg Val Gin Ser Thr Glu Leu Cys Ala Gly His Leu Ala 

2286 2301 2316 

GGA GGC ACT GAC AGT TGC CAG GGT GAC AGT GGA GGT CCT CTG GTT TGC TTC GAG 

Gly Gly Thr Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Phe Glu 

2331 23^6 2361 2376 

AAG GAC AAA TAC ATT TTA CAA GGA GTC ACT TCT TGG GGT CTT GGC TGT GCA CGC 

Lys Asp Lys Tyr He Leu Gin Gly Val Thr Ser Trp Gly Leu Gly Cys Ala Arg 

2391 2^06 2<*2l 

CCC AAT AAG CCT GGT GTC TAT GTT CGT GTT TCA AGG TTT GTT ACT TGG ATT GAG 

Pro Asn Lys Pro Gly Val Tyr Val Arg Val Ser Arg Phe Val Thr Trp. lie Glu 

2^36 2^+51 2^6^ 2^7<+ 2**B<+ 2<+9<« 

GGA GTG ATG AGA AAT AAT T A A TTGGACGGGA GACAG AGTGA CGCACT G ACT CACCT AGAGG 
Gly Val MET Arg Asn Asn 

250<* - 251^ 252<* 253^ 25^ 255<+ 256<* 

CTGGAACGAG GGTAGGGATT TAGCATGCTG GAAAT AACTG GCAGTAATCA AACGAAGACA CTGTCCCCAG 

257^ 25B<* 259<* 260<* 261<* 262<* 263<* 

CTACCAGCTA CGCCAAACCT CGGCATTTTT TGTGTTATTT TCTGACT GCT GGATTCTGT A GTAAGGTGAC 



26<^ H65<t 266^ 267<* 

ATAGCTATGA CATTTGTTAA AAAT AAACTC TGTACTTAAC TTTGA 
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